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(57) Abstract: HCV variants are described. The variants include polynucleotides comprising non-naturally occuiring HCV se- 
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HCVVARIANTS 

Background of Ae Xovention 

Reference to Goveniment Grant 
.5 This invmtion was made with government support under Public Health S^irice 

Grants CA S7973 and AI 40034. The government has certain rights in this invention. 

Background of the Invention 

10 (1) Field of the Invention 

The invention relates to materials and methodologies relating to &e production and 
use ofhepatitisC virus (HCV) variants. More q)ecifically,HCV variants are provided that 
are usefiil for diagnostic, flierapeutic, "i^ccines and other uses. 

IS (2) Description oftihie Related Alt 

Brief general overview of hepatitis C virus 
After the development of diagnostic tests for hepatitis A virus and hepatitis B virus, an 
additional agent, which could be experimoitally transmitted to chimpanzees [Mtsr et al.. 
Lancet 1, 459-463 (1978); HoUinger et al., Intervirology 10, 60-68 (1978); Tabw et al,, 

20 Lancet 1, 463-466 (1978)], became recognized as die major cause of transfusion-acquired 

hepatitis. cDNA clones corresponding to the causative non-A non-B (NANB) hepatitis agent, 
called hepatitis C virus (HCV), were reported in 1989 [Choo et al,. Science 244, 359-362 
(1989)]. This Ineakthrough has led to rapid advances in diagnostics, and in our understanding 
of the epidemiology, patihog^esis and molecular virology of HCV (For review, see Houghton 

25 et al., Curr StudHematol Blood Transfus 61, M 1 (1994); Houston (1996), pp. 1035-1058 
in FIELDS VIROLOGY, Fields et al., Eds., Rav«i Press, Philadelphia; Major et al., 
Hepatology 25, 1527-1538 (1997); Reed and Rice, pp. 1-37 in HEPATTITS C VIRUS, 
Reesink, Ed., Karger, Basel; Hagedom and Rice (1999), TEIE HEPATITIS C VIRUSES, 
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Springer, Berlin). Evidence of HCV infection is found throughout the world, and the 
prevalence of HCV-specific antibodies ranges from 0.4-2% in most countries to more tiian 
14%inEgypt[Hibbsera/.,y./i!/:Z)u. 168,789-790(1993)]. Besides transmission via blood 
or blood products, or less frequently by sexual and cong^tal routes, sporadic cases, not 
S associated with known risk &ctors, occur and accoimt for more than 40% of HCV cases 
[Alter ei aL, Jl Anu Med. Assoc 264, 2231-2235 (1990); Mast and Alter, Semin. Virol 4, 
273-283 (1993)]. Bofections are usually chraoic [AUbex et al., K Eng. J. Med. 327, 
1899-1905 (1992)], and clinical outcomes range from an inai^arcnt carrier state to acute 
hepatitis, chronic active hepatitis, and chifaosis which is strongly associated with the 
10 development of hepatocellular cannnoma. 

Although interferon C[FN)-a has been diown to be us^^ 
minority of patimts with dmmic HCV infections [Davis et aL, N. En^ J. Med 321, 
1501-1506 (1989); DiBiscegUe et aL, NewEngL J. Med. 321, 1506-1510 (1989)] and 
subunit vaccines show some promise in flie chimpanzee model [Choo et al., Proc Nad. Acad. 
15 USA 91, 1294-1298 (1994)], future efforts are needed to develop more effective 

therapies and vaccines (See, e.g., Tsambnas et al., 1999, Hq;)atitis C: Hope on the Horizon, 
Hq>atitis C Synq>05ium of 37^ Annual Meeting of the Infi^^ous Diseases Society of 
America, reviewed at 

http://www.medscape.com/medscape/cno/1999/IDSA/Storyxfin?story_id===^ The 
20 considerable diversity observed among different HCV isolates [for review, see Bukh et cd., 
Senu Liver Dis, 15, 41-63 (1995); Fanning et aL, 2000, Medscape Gastroenterology 
2:mgi6558.£Bam], the emergence of genetic variants in chronically infected individuals 
[Enomotoe/ai, J. Hepatol 17, 415-416 (1993); Hijikataer a/., Biochem. Biophys. Res. 
Comm. 175, 220-228 (1991); Kato et al,. Biochem. Biophys. Res. Comm. 189, 1 19-127 
25 (1992); Kato et al., J. Virol 67, 3923-3930 (1993); Kurosaki et al, Hepatology 18, 

1293-1299 (1993); Lesniewski et aL, J. Med. Virol 40, 150-156 (1993); Ogata et al., Proc. 
Natl Acad. Sci. USA 88, 3392-3396 (1991); Wetner et al., rirology 180, 842-848 (1991); 
Weiner et al, Proc Natl Acad. ScL USA 89, 3468-3472 (1992)], and tiie lack of protective 
immunity elicited after HCV infection [Farci et al. Science 258, 135-140 (1992); Prince et 
30 al., J. Infect Dis. 165,438-443 (1992)] present major challenges towards these goals. 

Molecular Biology of HCV 
Classification. Based on its goumie structure and virion properties, HCV has been 
classified as a aqxtrate g^nus in tiie flavivirus &mily, which includes two otiber genera: tiie 
35 flavivinises (e.g:, yellow fever (YF) virus) and tibe animal pestiviruses (eg., bovine viral 
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dianliea virus (BVDV) and classical swm ViroL 
SnppL 2» 223 (1991)]. All mmibm of Ibis &inily have enveloped virions that contain a 
positive-strand RNA genome encoding all knomi virus-specific proteins via translation of a 
single long open reading fiiame (ORF). 

S Structure and piQfsicalpropGiies of the virhn. Studies on the structure and pliysical 

properties of ^ HCY virion Imve been hampered by tbe lack of a cell ^ 
support efiScient virus replication and the typically low Utets of infectious virus piesent in 
serum. The size of infectious virus, based on fUtration experiments, is between 
[Bradley et al. Gastroenterology 88, 773-779 (1985); EcetoL, J. InfecL Dis. 156, 636-640 

10 (1987); Yuasa et al., J. Gen. ViroL 72, 2021-2024 (1991)]. Initial measunanents of tiie 

buoyant density of infectious material in sucrose yidded a range of values, with tiie majority 
presentinalowdensitypoolof<l.lg^ptradleyerdL, J.MedViroL 34,206-208 
(1991)]. Subsequent studies have used RT/PCR to detect HCV-specificRl^ 
measure of potentially infectious virus present in sera fixmi cbronicany infected humans or 

IS esqpenmentaUy infected chinopanzees. From these studies, it has become increasingly clear 
that considerable heterogeneity exists between difierrat clinical samples, and that many 
&ctors can affect the behavior of particles containing HCV RNA [Hijikata et oL, J, Virol 67, 
1953-1958 (1993); Thomssene/aiL,M«/.Mcra6/o/.//w^^ 181,293-300(1992)]. Such 
fectOTS include association with immunoglobulins [Hijikata et al, (1993) suprd\ or low 

20 density lipoprotein (Thomssen et al, 1992, supra\ Thomssen et al, Med. Microbiol 

Immunol 182,329-334(1993)1. In hi^ity infectious acute phase chimpanzee smnn, HCV- 
specific RNA is usually detected in fractions of low buoyant density (1.03-1.1 gAnI) [Carrick 
et al, J. Virol Metk 39, 279-289 (1992); Hijikata et al„ (1993) supra]. In odier san:q}les, the 
presence of HCV antibodies and formation of immune complexes correlate with particles of 

25 higher den^ty and lower infectivity [jffijikata et al, (1993) siq)ra]. Treatmoif of particles 
with chloroform, which destroys infectivity [Bradley et al., J. Infect Dis. 148, 254-265 
(1983); Feinstcme et al, IftfecL Immun. 41, 816-821 (1983)], or witii nonionic detergents, ' 
produced RNA containing particles of higher density (1.17-1.25 gAnl) believed to represent 
HCV nucleoc^ds [Hijikata et al, (1993) supra; Kanto et al., Hepatology 19, 296-302 

30 (1994); Miyamoto et al, J. Gen ViroL 73,715-718 (1992)]. 

There have been reports of negative-sense HCV-specific RNAs in sera and plasma 
[seeFongero/L, Journal of Clinical Imesdgadon 88:1058-60(1991)]. Howevo:, it seems 
unlikely that such RNAs are essential conqxments of infectious particles since some sera with 
high infectivity can have low or undetectable levels of negative-strand RNA [Shimizu et aL, 

35 Proc. Natl Acad. ScLUSA 90: 6037-6041 (1993)]. 
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The virion protein oonqiosition has not heen rigorously detennined, but HCV 
stnictural proteins include a basic C protein and two membrane glycoproteins, E 1 and E2. 

HCVreplicatioiL Early events in HCV replication are poorly understood. A 
hepatocyte recqitor may be CD81, i;^ch binds the E2 envelope glycoprotein (Pelm et aL, 
5 1998, Science 222:938-41). The association of some HCV particles with beta-Upoprotein and 
immunoglobulins raises &e possibilily tiiat these host molecules may modulate virus uptake 
and tissue tropism. 

Studies examining HCV rq)hcation have been largely restricted to hunumpat^ or 
experimentally inoculated chimpanzees. In the chinq>anzee model, HCV RNA is detected in 

10 the serum as early as three days post-inoculation and pmists through the peak of serum 
alanine aminotransfoase (ALT) levels (an indicator of liver damage) [Shimizu et al.. Proa 
NaiL Acad ScL USA 87: 6441-6444 (1990)]. The onset of viremia is followed by the 
qipearanceofindirecthaUniarksofHCV infection of the liver. These include the appearance 
of a cytoplasmic antigen [Shimizu et oL, (1990) stg>rd\ and ultrastmctural changes in 

15 hepatocytes such as the formation of microtubular aggregates for which HCV previously was 
refened to as the chloroform-sensitive "tubule forming agent" or "TFA" [reviewed by 
Bradley, Prog. Med. Virol 37: 101-135 (1990)]. As shown by the appearance of viral 
antigens [BUght et al.. Amer. J. Path 143: 1568-1573 (1993); Hiramatsu et ai, Hepatology 
16: 306-311 (1992); Krawczynski ai. Gastroenterology 103: 622-629 (1992); Yamada et 

20 al., Digest Dis. ScL 38: 882-887 (1993)] and the detection of positive and negative sense 
RNAs [Fong et at, (1991) supra; Gunji et al.. Arch ViroL 134: 293-302 (1994); Haruna et 
al., J, Hepatol 18: 96-100 (1993); lamas a/., J. flgifl/oi 16: 219-223 (1992); Noun Aria 
et al.. J. ain. Jnves. 91: 2226-34 (1993); Sheiker et ai.. J. Med. Virol 39: 91-96 (1993); 
TBkehanietaI.,Hq>aioiogy 15: 387-390 (1992); Tanaka a/., Idyer 13:203-208(1993)], 

25 hepatocytes appear to be a maj<»' site of HCV rqilication, particularly during acute infection 
[Negroes a2.. Proa Natl Acad ScL USA 89:2247-2251(1992)]. In lat^ stages of HCV 
infection the qspeanmce of HCV-specific antibodies, the posistenoe or resolution of viremia, 
and the severity of liver disease, vaiy greatly both in flie chimpanzee model and in human 
patioits (Farming etaL,5i<pfa). Altibough some liver damage may occur as a direct 

30 consequence of HCV infection and cytopalhogenicity, flie emergrng consensus is that host 
tnnnune responses, in particular virus-qi^ecific cytotoxic T lymphocytes, may play a more 
dominant role in mediating cellular damage. 

It has been speculated Oat HCV inay also rqpUcate in extra-hepatic reservoir(8). In 
some cases, RT/PCR or in situ hybridization has shown an association of HCV SNA wifli 

35 peripheral blood mononuclear cells including T-cells, B-cells» and monocytes [reviewed in 
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Blight and Go^vans, Viral Hepatitis Rev. 1: 143-155 (1995)]. Such tissue tropism could be 
relevant to the establishment of chronic infections and might also play a role in flie 
association between HCV inJfection and certain immunol<^cal abnonnalities such as mixed 
cryoglobulinemia [reviewed by Ferri et al„ Eur, J. Qin. Invest 23: 399*405 (1993)], 

5 gilomerulonqphritis, and rare non-Hodgkm's B-lymphomas [Ferri et al, (1993) suprar^ Kagawa 
et al.. Lancet 341: 3 16-317 (1993)]. However, the detection of circulating negative strand 
KNA in serum, flie difBculty m obtaining truly strand-specific RT/PCR [Gunji et ai., (1994) 
stqfra] , and tfie low numbers of apparently infected cells have made it difiicult to obtain 
unambiguous evidence for Tq)lication in these tissues in vivo. 

10 Genome structure. FuH^eagdi or nearly fuU4engtfa genome sequences of numerous 

HCV isolates have been reported [see, eg., Usietail.,J. Virol 68: 5063-5073 (1994a); 
Okamoto et al., J. Gen. Virol 75: 629-635 (1994); Sakamoto et al, J. Gen. Virol 7S: 
1761-1768 (1994); Trowbridge etal,>ircft Virol /«:501-511 (1998); Chamberlain etal,/. 
Gen- Virol 75:1341-1347 (1997); and citations within Davis, Am. J. Med. 27:21S-26S]. HCV 

15 genome RNAs are '^.6kilobases(kb) in length (Figure 1) and consist of a 5' nontranslated 
region (5 ^ NTR), a polyprotein coding region consisting of a single long open reading fiiame 
(ORF),anda3*NTR. The 5' NTR is 341-344 bases loiig and highly consaired. The length 
of the long ORF varies slightly among isolates, encoding polyprotetns of about 3010 to about 
3033 amino acids. 

20 The 3' KIR can be divided into ftree domains. The first (most 5*) domain shows 

considerable divmity both in composition and length (28-42 bases). Recent work by Yanagi 
et al. [Proc. Natl. Acad. Sci. USA 96:2291-2295(1999)] demonstrate that this region is not 
necessary for virus replication. The second domain is consists of a variable length 
polypyrimidine region of poly(A) (in at least HCV-1, type la [Han et al, Proc. Natl Acad. 

25 Sci. USA 88:1711-1715 (1991)]) or poly(U-UC) (see Chen et al, Kirology 188:102-113 
(1992); Okamoto et al., J. Gen. Virol 72:2697-2704 (1991); Tokita et al., J. Gen. ViroL 
66: 1476-83 (1994)]. The Ihird domain, at the «treme 3* end of the genome, is a highly 
conserved, novel KNA element of about 98 nucleotides, which is necessary for ^cient 
initiation of viral RNA rq>lication [see, e.g., U.S. Patent No. 5,874,565 and.U.S. Patent 

30 Application No. 08/811,566 (NowU.S. Patent No. ); Kolykhalov et al., J. Virol 70: 

3363-3371 (1996); Tanaka et al, Biochem. Biophys. Res. Comm. 215: 744-749 (1996); 
Tanaka et al, J. Virol 70:3307-12 (1996); Yamada et al. Virology 223:255-261 (1996); 
C3iengetal.J: Virol 73:7044-7049]. This domain and flie polypyriinidine regi<nis qypear to 
be critical for infectivity in vhv [Yanagi et al.. Proa Natl. Acad. Sd. USA 9<f:2291-2295 

35 (1999)]. 
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Translaiwn and proteofyHc processing. The conserved 5' NTRsequoice 
contains miiltiple short AUG-initiated ORFs and shows significant homology witii Oor S' NTR 
region of pestivinises [BuKh et aL, Proa Natl Acad. Sci, USA 89: 4942-4946 (1992); Han et 
al., (1991) sign-a}. A s^es of stem-loop structures that interact wi& host &ctors are present 

S These structures interact with host &ctois to initiate polyprotein synthesis through an internal 
ribosome catry site (IRES) allowing efficient translation initiation at the first AUG of Ihe long 
ORF [Honda etal.,/. Ww/ 73:4941-4951 (1999); Tang etal.,/, ViroL 75:2359-2364(1999); 
Psaridi et al., FEBSLetL 453:49-53 (1999)]. Some of fte predicted features of &e HCV and 
pestivirus IRES elements are shnilar to one anofter [Brown era/., (1992) signn}. Theability 

10 of this element to function as an IRES suggests fliat HCV genome RNAs may lack a 5' cap 
structure. 

The OTganizatiQn and processing of flie HCV iwlyprotein ^Figure 1) qipears to be 
most similar to ttiat of the pestiviruses. At least 10 polypeptides have been identified and the 
order of these cleavage products in die polyprotein is NH2-C-El-E2p-p7-NS2-NS3-NS4A- 

15 NS4B-NS5A-NS5B-COOH. As shown in Figure 1, proteolytic processing is mediated by 
host signal peptidase and two HCV-encoded protemases, die NS2-3 autoproteniase and the 
NS3-4A serine proteinase [see Rice, In "Fields Virology" (B. N. Fields, D. M. Knipe and P. 
M. Howley, Eds.), Vol. pp. 931-960. Raven Press, New York (1996); Shimotohno et oL, J. 
Hepatol 22: 87-92 (1995) for reviews]. C is a basic protein that serves as the viral core or 

20 capsid protein; El and E2 are virion envelope glycoproteins; p7 is a hydrophobic protein of 
unknown function lhat is inefGciently cleaved fixnn the E2 glycoprotein [Lin et al., (1994a) 
st^ra; Mizushima etal, X Virol 68: 6215-6222 (1994); SeSbyetal,, Virology 204: 114-122 
(1994)]. NS2-NS5B are nonstructural (NS) proteins which function in viral RNA replication 
complexes. Their functions have been identified as follows: NS2 is a metall<^m>tease; NS3 is 

25 a protease/helicase that contains motifs charactedstic of KNA helicases and that has been 

shown to possess an KNA-stimulatedNTPase activity [Suzich et al., J, Virol 67, 6152-6158 
(1993)]; NS4A is a co-fector for NS3; NS4B is of unknown function; NS5A interacts with 
cellular &ctars to transcriptionally modulate cellular genes and promote cell growth [Ghosh et 
al.,7. Biol Chem. 275:7184-7188] and provide IFNa resistance; and NS5B isar^licasediat 

30 contains the GDD motif characteristic of the SNA-dependent SNA polymerases of other 
positive-stiand SNA vnuses. 

Virimassernbfy and release. This process has not hem examined directly, but die 
lack of complex glycans, the ER localization of expressed HCV glycoprotems p^iibuisson et 
al, J. Virol 68: 6147-6160 (1994); Ralston et al, X Virol 67: 6753-6761 (1993)] and the 

35 absence ofthese proteins on die cell sur&ce[Dubuissone/ a/., (1994) ^t(p7vr; Spaete aiLp 
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Virology 188: 819-830 (1992)] suggest diat initial virion moiphogenesis may occur by 
budding into intracellular vesicles. Thus &r, efQcient particle formation and release has not 
hem observed in transient repression assays, suggesting tiiat essential vital or host &ctors are 
absent or blocked. HCV virion formation and release may be inefBdent, since a substantial 

5 fiactionof&e virus remains cell-associated, as found for die pestivinises. ETcteacellularHCV 
particles partially purified fiom human plasma contain complete N-linked glycans, althougjh 
diese carbohydrate moieties were not shown to be specifically associated with BlocBl [Sato 
ei al. Virology 196: 354-357 (1993)]. Complex gjtycans associated with glycoproteins on 
released virions would suggest transit tbroug^ the trans-Golgi and movement of virions 

10 through the host secretoiypatfawar/. Ifthisiscon«^intracdluIarsequestrati(mofHC^^ 
grlycoproteins and virion finmation might then play a role in flie estabhshment of chronic 
infectiom by niinimizing immune surveillance ^ lysis of virus-infected cells via 

antibody and complement 

Gen^ic variability. As for all positive-strand KNA. viruses, die KNA-dqpendent 

IS SNA polymerase of HCV (NSSB) is believed to lack a 3-5' exonuclease proofreading 

activity for removal of misincoiporated bases. Replication is dierefore etror-prone, leading to 
a "quasi-species" virus pqiulation consisting of a large number of variants [Martell et al,, J. 
ViroL 66: 3225-3229 (1992); Martell a/., J. Virol 68:3425-3436(1994)]. IWs variability 
is apparent at multiple levels. First, in a chronically infected individual, changes in the virus 

20 population occur over time [Ogata et oL, (1991) supra; Okamoto et al.. Virology 190: 
894-899 (1992)]; and these change may have inq)ortant consequences for disease. A 
particiilarly interesting example is die N-tenninal 30 residue segment of the £2 glycoprotein, 
which exhibits a much higher degree of variability dian the rest of the polyprotem [for 
examples, see Higadri et oL, Virology 197, 659-668. 1993; Bfijikata et aL, (1991) supra\ 

25 Weiner et al, (1991) ^upm]. There is accumulating evidence that tibis hypervariable region, 
called hypovariable region 1 (HVRl), podiaps analogous to die V3 domain of HIV-1 gpl20, 
may be under immune selection by drculating HCV-specifio antibodies [Kato et al,, (1993) 
m;?nz; Taniguchi ^ al. Virology 195: 297-301 (1993); Werner et al, (1992) s^pra. hi this 
model, antibodies directed against this portion of E2 may contribute to virus 

30 and thus drive the selection ofvarianfcswidi substitutions that penn^ 

neutralization. This plastidty suggests that a specific amino acid sequence in the E2 
hypervariable region is not essential for otiier fimctiims of pt>tein such as virion 
attachment, penetration, or assembly. Goietic evolution ofHVRlwidiin the first 4 montiis of 
infection has bem conslated widi the ability of a particular strain of the vinis to c^ 

35 infection CFaici et al.. Science 2^:339-344 (2000)]. 
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Genetic variability may also contribute to the spectrum of dijBTerent responses 
observed after IFN<KX treatment of cfaromcallyMectedi^ Diminished serum ALT 
levels and improved liver histology, which usually oonelates with a decrease in the level of 
circulating HCVRNA, is sem in '--40% of those treated [Ore^ Gen. ViroL 

S 72: 2015-2019 (1991)]. Afto: treatmmt, qiproximately 70% of die respondos relapse. In 
some cases, aft^ a transient loss of circulating viral KNA« renewed viremia is observed 
during or after die course of treatment While dus might suggest the existence or generation 
oflFN-HresistantHCY genotypes or variants^furdier work is needed to determine die relative 
contributions of virus genotype and host-specific differences in immune response. 

10 Sequence comparisons of different HCV isolates around die world have also revealed 

enormous graetic diversity [reviewed in Bukh et oL, (1995) supra]. Because of the lack of 
biologically relevant serological assays such as cross-neutralization tests, HCV types 
(designated by numbersX subtypes (designated by letters), and isolates are currently groiqied 
on die basis of nucleotide or amino acid sequence similarity. Worldwide, HCV has been 

15 classified into six major genotypes and more than 50 subtypes [Purcell, Hepatology 2(5:1 IS- 
14S (1997)]. Those of greatest importance in die U.S. are genotype 1, subtypes la and lb 
(see below and Buldi et oL, (1995) st^ra for a discussion of genotype prevalence and 
distribution). Amino acid sequence similarity between the most div^gent genotypes can be a 
litde as -50%, depending upon die protein being compared. This diversity has important 

20 biological implications, particularly for diagnosis, vaccine design, and therapy. 

HCV UNA replication. Byanalogy with odx^flaviviruses^repUcation of the positive- 
sense HCV virion RNA. is diougbt to occur via a ininus^trand intermediate. This strategy can 
be described briefly as follows: Ci) uncoatiqg of ttie incoming virus particle.releases the 
genonuc plus-strand, vMch is tnmslflted to piroduce a single long polyprotein that is probably 
'25 processed co- and post-translati(Hially to produce individual stmctural and nonstructural 
protein^ (ii) the nmstructoral proteins f oim a replication complex that utilizes die virion 
KNA as template for the synthesis of nunus strai^ (lii) these minus strands in turn serve as 
tenq)latBS for ^thesis of phis strands, wfaidi can be used for additional translation of viral 
pnitein, minus strand synlfaesis» or packagmgmto Very few details about 

3 0 HCV rqilication process are available, due to the lack of a good experimental system for virus 
propagatioiL Detailed analyses of audientic HCV rqplication and odier steps m the viral life 
cycle wouldbe gready fiidfitatedby the development of an efficient system far HCV 
rq>lication in cell culture. 

Mary attempts have been made to infect cultured cells with serum collected fiom 

35 HCV-infected individuals, and low levels of replication have been rqxirted in a number of 
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cells types infected by this method, including B-ceU [Bertolini et oL, Res. Virol 144: 
281-285 (1993); Nakajima a/., y. ViroL 70: 9925-9 Q996);yametaL, Res. VtrvL 146:2S5- 
288 (1995)], T-cell (Kato et al., Biochem. Biopkys. Res. Commtm. 20tf:863-9 (1996); 
Mizutani et al, Biochem. Biophys. Res. Comm. 227:822-826; Mi2xitani et al., J. VbroL 70: 

5 7219-7223 (1996); Nakajima ei oL, (1996) «(pra; Shimizu and Yoshikura, / Virol, 68: 8406- 
8408 (1994); Shimizu et al. Proa NatL Acad. Sd USA. 89: 5477-5481 (1992); Shimizu et aL, 
Proa Nail. Acad. ScL USA, 90: 6037-6041 (1993)1 and hq>atocyte [Kato et aL, Jpru J. 
Cancer Res., 87: 787-92 (1996); ragomi, /. Gastoetiterol andHepatoL. JO: 523-527 (1995)] 
ceU lines, as weU as peripheral blood monocular cells (PBMC^^ Virol, 

10 7d: 2485-2491 (1995)], and primaiy cultures of human fetal h^toc^^ 

Virol Siqypl 8: 31-39 (1993X CnTrier ct al., (1995) siq}ra; lacovacd et al.. Res. Virol, 144: 
275-279 (1993)] or hq>atocytes fiom adult chnnpanzees [Lanford et al.. Virology 202: 606-14 
(1994)]. HCV Tq>Ucation has also been detected in primary hepatocytes derived from a 
human HCV patient that were infected with flie virus in vivo prior to cultivation [Ito et al., J. 

15 Gen. Virol 77: 1043-1054 (1996)] and in the human hepatoma cell line Huh7 following 

transfection with RNA. transcribed in vitro from an HCV-1 cDNA clone [Yoo et al., /. Virol, 
69: 32-38 (1995)]. The repcHted observation of replication in cells transfected with KNA 
derived from tiie HCV-1 clone was puzzling, since Qiis clone lacks die required tominal 
3'NTR sequCTce downstream of die homopolymer tract (see below), and because a number of 

20 unusual observations were r^rted (see the background section of U.S. Patent Application 

No. 08/811,566 (Now U.S. Patmt No. )). The most well-characterized cell-culture 

systems for HCV replication utilize a B-cell line (Daudi) or T-cell lines persistoidy infected 
with retroviruses (HPB-Ma or MT-2) [Kato et al., (1995) «ipra; Mizutani et al., Biochem 
Biophys Res. Comm.^ 227: 822-826 (1996a); Mizutani et al., (1996) stspra; Nakajima et al., 

25 (1996) stqfra; Shimizu and Yoshikura, (1994) supra]; Shunizu, Proc. Natl. Acai Sd. USA, 
90: 6037-6041 (1993)]. HPBMa is infiDcted wifli an amphotropic murine leukemia virus 
pseudotype of murine sarcoma virus, ^le MT-2 is infected with human T-ceU lynqihotropic 
virus type I (HTLV-I). Qones(HPBMalO-2 and MT-2Q that support HCV rq>lication more 
efiSciently than the uncloned population have been isolated for the two T-cell lines HPBMa 

30 and MT-2 [Mizutani et aL J. Virol (1996) stq}ra; Shhnizu et al., (1993) stq>ra]. However, the 
TiMYiTTnim levels of RNA rq>lication obtained in &ese lines or in the Daudi lines after 
degradation of flie input SNA is still cmly about 5x10^ RNA molecules per 1 0^ cells 
(Mizutani et aL, (1996) stgmi; MSzutani et al., (1996) siqrra] or 10^ RNA molecules per ml of 
culture medium (Nakajima et aL, (1996) sign'a]. Alttiough the level of replication is low, 

35 long-term infections of ijq) to 198 days in one system [Nfizutani et al., Biochem. Biophys. Res. 



wo 01/89364 PCTAJS01/ld822 

10 

ComnL 227: 822-826 (1996a)] and more liian a year in anotber syston [Nakajima et al., 
(1996) sig>rd\ have been documented, and infectious virus production has hem demonstrated 
by serial cell-free or cell-mediated passage of the virus to naive cells. 

However, efBcient replication of an HCV clone comprising the essential conserved 

S terminal 3' NTR sequence had not been observed until the work described in co-pending 

application 08/81 1 ,566, now U.S, Patent No. , also rqxnted in KolyUbalov et aL, 

Science 277:570 (1997), which describes an infectious clone of an isolate of the H strain (lype 
la). HCV clones of other subtypes are now known. See, e.g., Yanagi et al., Vbvlogy 
2d[2:250-263 (1999) and YanagietaL, Virology 244:161-172(19961 While SNA transcripts 

10 of fliese clones are able to infect chimpanzees^ cell cultures with ttiese clones only support 
replication of die viruspoorfy if at all. 

As desoibed in U.S. Patent AppUcation No. 08/81 1,566 (NTow U.S. Patent No. ) 

(see, e.g.. Figure 2 Iherem) many variations of a fimcticHial clone aie possible. These include 
fidl length or partial sequences a foreign gene is inserted The foreign gene can 

1 S include, e.g., a reporter g^e such as P--galactosidase or luciferase, or a gene encodmg a 

selectable marker such as n^,Z)i2F7{, or A. In a specific ^Kaiiq>le disclosed therein, fhe Tieo 
gene is operably linked to an internal ribosome entry site (IRES), in order for infected cells to 
be selected by neomydn or G418 resistance. In this way, presence of replicating HCV BNA 
in essentially all surviving cells is assured. Additionally, the HCV polyprotein coding region 

20 of these clones can be deficient in some or all of the structural genes C, El and . E2. Thus, 

replicons can be created without the production of virions. By combining die structural gene- 
deficient constmct with a selectable marker such as neo, an efficiently replicating replicon 
system can be created diat can be used to study HCV replication and for other purposes. . 

Examples of the rq)licons disclosed in U.S. Patent Application No. 08/81 1,566 (Now 

25 U.S. Patent No. ) is provided in Lohmann et al.. Science 285: 1 10-1 13 (1999). In that 

work, DNA clones of HCV replicons of genotype 1, subtype lb were constructed. Features 
of those replicons diat are not wild-type HCV features are: a polyprotein coding region 
lacking die genes encoding the HCV stnictoralproteiiis; an EMCV IRES immediately 5* to 
the polyprotein r^cm; and a neo gene immediately 3 ' to the 5' NTR. (and die HCV IRES) , 

30 where die S* end ofihe HCV Cprotem gene is fused to die 5' end offhe/ieQ gene. Vfhsn 
Huhr7 cells woe transfected mttiRNA transcripts of these clones, 6 to >60 G418-rBsi8tant 
colonies arose per esqieriment Aldiough die numlwofcells treated was not specified, about 
10^- 10^ cells axe normally treated in experiments of diis type. Therefore, it is believed diat 
the trans&ction dGEciency, as measured by G418-resistant colonies/total treated, was less dian 

35 .01% in those studies. 
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Controls in tbe Lobnuum et al. woik included in-frame deletions of the active site of 
tfaeNSSB polymeiase. Although care was taken to remove template DNA fitmi the control 
transcripts, several G4l8-resistant control colonies arose. Still, the number of 0418-resistant 
control colonies that arose was much less than the colonies arising from the cells transfected 

S with the replicons containing the wild-type NSSB. 

When HbR G418-resistant colonies were subpassag^ most could not be maintained 
Out of more than 303 G418Hnesistant colonies from nochcontrol rq>licon treatments, 9 (<3%) 
could be subpassaged to establish stable cell lines. Replicons established in infected cell lines 
were sequenced. AlAoug^ each replicon had a number ofamino add substitution 

10 substitutions were scattered fliroughout the polyprotein coding region. Tlierefore, thae were 
no mutations that were consistenily in one area of the polyprotein coding region, and it was 
concluded Oat tte establishment of tibe nine cell lines was not due to ^dasptim mutations in 
those replicons. This ccmtention was expeiimentally tested by transfection/reconstitution 
experiments that did not provide evidence for adaptive changes. 

1 5 Despite the advances desaibed above, more efiBdent HCV-infected cell systems are 

needed for the production of concoitrated virus stocks, structural analysis of virion 
components, evaluation of putative antiviral therapies including vaccines .and antiviral 
compoimds, and improved analyses of intracellular viral processes, including SNA . 
replication. Thus, thwe is a need for various types of HCV clones that can be used for any of 

20 the above purposes. There is also a need to characterize HCV with respect to regions of the 
genome diat n:iight contribute to more efSdent in vitro or in vivo r^lication and virion 
production. 

Simmiary of the InventicHi 
25 Thus, a priniary object oflhie present invention has been to provide DNA en . 

non-naturally occurring HCV tfiat is dupable of rq>lication. 

Arelated object of die invenficm is to provide genomic KNA from the above DNA. 
Still another object of the invention is to provide attenuated HCV DNA or genomic KNA 
suitable for vacdne develqpmat, whidi can invade a cell and replicate but cannot propagate 
30 infectious virus. 

Another object of the invention is to provide in vitro and in vivo models of HCV 
infection and RNA rq>licatiQn fcM- testing anti-HCV (or antiviral) drugs, for evaluating drug 
resistance, and for testing attmuated HCV viral vacdnes. 
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An additional object of the invoition is to provide replicating HCV replicons. These 
Tq)licons do not encode structural proteins but nmy encode a foreign protein such as a 
reporter gene or a selectable marker. 

Still ano&er object of the invention is to provide adaptive replicons, wifii increased 

S ability to establish replication in continuous or primary cell lines. 

Briefly, dierefore, the invraitors have succeeded in discovering methods of creating 
replicating HCV variants, including variants widi adqitive mutations in HCV diat improve 
tbeir ability to establish RNA replication in culture to create continuous cell lines. These 
HCV variants and the cell lines that harbor Ifaem are useful for studying replication and other 

10 ' HCV characteristics. Ibe cell lines are also usefid for developing vaccines and for testing 
compounds fiir antiviral properties. 

Thus, in scmie onbodiments, the present invention is directed to a polynucleotide 
com pr is m g a non-natural^ occurring HCV sequence ftat is capable of productive replication 
in a host cell, or is capable of being transcribed into a non-naturally occurring HCV sequence 

IS that is capable ofproductiverqplication in a host cell. IbeHCV sequmce comprises, fipom 5' 
to 3' on tbe positive-sense nucleic acid, a fimctiaaal 5' noortranslated region (5' NTR); one or 
more protein coding rpgions, including at least one polyprotein coding region that is capable 
of r^licating HCV RNA; and a fimctional HCV 3' non-translated region (3' NIS). In 
preferred embodimmts of these polynucleotides, the 5' NIK. is an HCV 5' NTEt, flie 

20 polynucleotide comprises at least one IRES selected fiom the group consisting of a viral 

IRES, a ceUular IRES, and an artiiBcial IRES, and the polyprotein coding re^ . 
polyprotein coding region. 

In certain aspects of these embodiments, the above polynucleotides fur&er conqnise 
an adaptive mutation. The adaptive mutati<m can be such ^t the polynucleotide has a 

25 transfection efficiency into mammalian cells of greater than 0.01%; more pref^ably greats 
than 0.1%; even more preferably, greater dian 1%; still more preferably greater &an 5%, may 
be about 6%. The adaptive mutations can be such lhat the polynucleotide is capable of 
replication in a non-hqmtic cell, for exanq>le HeLa cells. The adaptive mutations can also 
cause the polynucleotide to have att^uated virulence, wherein the HCV is impaired in its 

30 ability to cause disease, establish chronic infections, trigger autoimmune responses, and 
transform cells. 

In some embodiments of the above described adsrptive mutants, the polyprotein 
rpgion comprises an NSSA gene ttiat is not a wfld-typeNSS A gene. Preferably, the NS5 A 
gene comprises a mutation. Hie mutation is preferably within SO imcleotides of an ISDR or 
3 5 includes the ISDR; more preferably fbo mutati9on is widun 20 nt of the ISDR, or includes the 
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ISDR. Exainplesofftese adaptive mutations are those fhat encoder 
change selected fiom the groiq> consisting of Ser (1 179) to He, Arg (1 164) to Gly, Ala(l 174) 
to Ser, Ser(l 172) to Cys, and Ser(l 172) to Pro of SEQ ID N0:3. Other adaptive mutations 
include a deletion ofat least a portion ofthe ISDR, and may compise the entte Ina 
S particular embodiment, the adaptive mutation comprises a deletion of nucleotides S34S to 
5485ofSEQIDNO:6. 

hi some embodimmts of tiie invoition polynucleotides, the HCV polyprotein coding 
region mcodes all HCVstnictural and nonstructural piotein& In othor embodiments, the 
polyprotein coding region is incapable of malting infectious HCV particles, making the HCV 

10 variant a rq)Ucon. Preferably the inability to make HCV particles is due to a iteletion in 
stnictural protein coding region. SomembodimentsoftheseiepHconsfurtiierconqnisea 
foreign gene qperably Unked to a first IRES and fte HCV polyprotein coding region operably 
linked to a second IRES. Prefembly, the iq>Uconconqxrises a genotype 1 HCV sequence, 
most prefoably subtype lb. Preferred foreign genes in tibese replicons are selectable markers 

15 or reporter genes. In other preferred replicon embodimaits, the first IRES is an HCV IRES, 
thef(n:eigngaieisaneagene,andthesecondIRESisa£MCVIR£S. Examples of the 
above replicons include SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:22 and SEQ ID NO:25. 
The above replicons also preferably comprise an ads^tive mutation, including any of the 
adq>tive phenotypes jireviously described, including increased transfection efficiency, 

20 replication in a non-hepatic cell including HeLa cells, and attenuated virulence, and further 
comprising any of the ad£qptive mutations previously described, such as the various NSSA 
mutations and deletions previously desoibed. 

The polynucleotides of the present invaition can be in the form of RNA or DNA. 
Preferred embodiments of the polynucleotides are SEQ ID NOs:5-13 and 22-25, the 

25 complements tiiem>i^ and the RNA equivalents of tiie sequences or their coni^ hi 
certain CTbodim^ats, the polynucleotides are cqmble of productive infection in a chinqjanzee 
iqxm intiahqiatic injection. 

The present invention is also directed to esqpiession vectors oonoprismg DNA forms of 
any of the above polynucleotides, opmibly associated with a promoter. Additionally, the 

30 invration is directed to cells comprising the above expression vectors as well as host cells 
comprising any of the polynucleotides described above. The host cells are preferably 
mammalian cells, more preferably human cells. The host cells are prefaably he{^tocytes, T- 
cells,B-^lls,orfi)resldnfayrohlast^mostpr^ Certain adaptive mutants 

can also replicate in HeLa cells. The host ceUs can he within a non-human mammal cqiable 
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of supporting tiansfection and replication of Hie HCV KNA, and infection yAim Hie HCV 
KNA encodes a virus particle. A prefened non-human mammal is a chimpanzee. 

In additional embodimoits, Ifae presoit invention is directed to methods for 
identifying a cell Une that is permissive for RNA replication with HCV. The method includes 
S die steps of contacting a cell in tissue culture with an infectious amount of the above- 
described polynucleotides, and detecting rq)lication of HCV variants in cells of die cell line. 

Ihe present invention is also directed to a mediod for producing a cell line 
comprising rq)licating HCV. The method includes die steps of (a) transmhing die above- 
described expression vector to syndiesize HCV SNA; (b) transfecting a cell widi the HCV 
10 RNA; and (c)culturing die cell. 

Additionally, die present invention is directed to a vaccine. The vaccine includes any 
of the above-described polynucleotides, in a pharmaceuticaltir.acccplable carrier. In related 
embodiments, the presmt invention is directed to a method of inducmg immunoprotecti<m to 
HCV in a primate. The method includes administering the vaccine to die primate. 
15 In further embodiments, the present invention is directed to a method of testing a 

compound for inhibiting HCV replication. Ihe mediod includes the steps of (a) treating the 
above described host cells with the compound; and (b) evaluating the treated host cell for 
. reduced replication, wherein reduced HCV replication indicates the ability of die compound 
to inhibit replication. 

20 In additional embodiments, die present invention is directed to a method of testing a 

compound for inhibiting HCV infection. The mediod comprises treating a host cell with the 
compound before, during or after infecting the host cell with any of the invention 
polynucleotides. 

In still odier embodiments, die present invention is directed to an HCV variant diat 
25 has (a) transfection eflScimcy greater dian 0.01%, as determined by replication-dependent 
neomycin resistance, or (b) greater ability of initial colonies of cells Iransfected with die 
variant to survive subpassag^ than wildrtype HCV gwo1ypel,subty^ The HCV variant 
also has, fixmi 5' to 3' on die positive-sense nucleic acid, a functional HCV 5' non-transkted 
region (5'NTR) conqirismg an extreme 5 -tenninal conserved sequence; an HCVpolyprotein 
30 coding region; and a functional HCV 3' non-translated region (3'NTR) compnsmg a variable 
region, a polypyrimidine region, and an extreme 3 -teraund conserved^ Inprefened 
embodiments, the transfection dOKdency is greater dian 0. 1 % ; in mote prefened 
embodiments, greater than 1%; in still more p r efe rred embodiments, greatordian 5%. lathe 
most preferred embodiments, die transfection elBBcimcy is about 6%. 
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Hie variants can have any of fhe characteristics of the polynucleotides desmbed 
above. However, prefmed variants comprise tbe NSS A mutation or deleti^ 
fhe polynucleotides above. 

Anumg the several advantages achieved by tiie present invention are the provision of 
S polynucleotides comprising non-naturaUy occurring HCV sequences; the provision of HCV 
variants that have a transfection efficiency and ability to survive subpassage greater than 
HCV. forms that have wild4ype polyirotein coding regions; the provision of oqiression 
vectors comprising the above polynucleotides and HCV variants; the provision of cells and 
host cells conqmsing tiie above expression vectors, the provision of mettiods for identifymg a 
10 cell line that is pmnissive for KNA rq>lication with HCV; the provision of vaccines 

c(mqnising the above polynucleotides in a phannacoitically accq>table carrier; tiie provision 
of methods for inducing immurKqmytection to HCV in a primate; and tiie provision of 
methods for testing a compound for inhibiting HCV replication. 

15 Brief Descaiption of flie Drawings 

FIGURE 1. HCV genome structure, polyprotein processing, and protein features. Atthetop 
is depicted the viral genome with the structural and nonstructural protein coding regions, and 
tiie S'and 3' MTRs, and the putative 3' secondary structure. Boxes below the goiome indicate 
proteins graerated by the proteolytic processing cascade. Putative structural proteins are 

20 indicated by shaded boxes and the nonstructural proteins by open boxes. Contiguous 

stretches of uncharged amino acids are shown by black bars. Asterisks denote proteins with 
N-linked glycans but do not necessarily indicate tiie position or number of sites utilized. 
Geavage sites shown are for host signalase (4), the NS2-3 proteinase (curved anow)» an the 
NS3-4A serine protease (% 

25 

FIGURE 2. Strategies for ejg}ressionofheterologousRNAs and proteins mingHCT vectors^ 
At the top is a diagram of the positive-polarity RNA virus HCV, which expresses mature viral 
proteins by translation of a single long ORF and proteolytic processing. The regions of the 
polyprotein encoding the structural proteins (STRUCTURAL) and the nonstructural proteins 

30 (EtEPLICASE) are indicated as lightly-shaded and open boxes, respectively. Below are 
shownanumbarofpnqx)sedrepticatioiMxmq)ete&t*Veptic(m The 
first four constructs (A-D) lack structural genes and would therefore require a helper system 
to enable packagmg into infectious virions. Constructs B-G would not require helper 
functions for rq^lication or packaging. Darkty shaded boxes indicate heterologous or foreign 

35 gene sequences (FG). Translation initiation (aug) and termination signals (trm) are indicated 
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by open triangles and solid diamonds, respectively. Internal ribosomes entiy sites (IRES) aie 
shown as boxes with vertical stripes. Ccmstructs AandHiUustratethee^qpressionofa 
heterologous p]x>duct as an in-fimiefusi(m with AeHCVpoly^ Such protein fusion 
junctions can be engineered such that inocessing is mediated either by host or viral 
S proteinases (indicated by the arrow). 

FIGUSES. Structure of HCFreplbBarthfan. Two versions ofthis infectious rqilicon were 
ccmstructed as desoibed in Example 1. Thefirs^HCVreplbBartMan/AvaII,hasaj4van 
restriction site in the variable domain of &e 3' NTRdiat is not present in the 3'NTRof wild- 
10 type HCV subtype lb. The second variant, HCVreplbBartMan/A2irs, has 32, raflra* than the 
wild-type 34, IPs m tiie longest stretch of contiguous ITs in tibe polypyrimidine domain of the 
3' NPL The "GDI>-^AGG" designation shows the inactivating mutation in tiie non- 
leplicatmgiqpUcoiis that were used as polymerase-ininuscontrok in Ex^^ 1. 

IS HGURE4. Generation of G418'resistant cell dones. At the top is a diagram of the 

HCVreplbBartManieplicons as described in Figure 3. Hie middle text summarizes flie steps 
used to isolate the adaptive mutants, wMch are ftnlher described in Exai^ Tliebottom 
chart summarizes several characteristics of some of the replicons isolated as described in the 
Example. 

20 

FIGURES. Synthesis of HCV'SpedficRNA arid proteins. Figure 5A illustrates actinomydn 
D-resistant KNA replication of four adaptive rq)licons as further described in the Example. 
Figure SB illustrates die inmiuiu>precipitation of ^S-labeled HCV-specific proteins of three 
adaptive replicons as further described in Example 1. 

25 

FIGURE 6. Detection cfNS3 in G418-resistaracdl clones. Monolayers of cells transfected 
with various repUcons as indicated were inmnmostainedwidi an anti-NS3m^ Patterns 
of staining were similar to cells staiiied fiom an infected liver. 

30 FIGURE?, thudeotide and anmoaad changes in th^ 

Nucleotide and ainino acid chaiig^ in a portion of the NSS A coding region of seven adaptive 
clones are indicated. 

FIGURE 8. G418-resistant colonies generated after electroporation of r^licon SNAs into 
35 Huh7cells. The abiHtyofan adaptive r^Hcon(R£pUconQ to establish colonies af^ 
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transfection into Huh7 cellsXmiddle) is covapsaed to Hie origiiial iq)licon 
HCViepBartMaii/Avall (left) and the same adaptive replicon, but with an inactivating 
mutation in flie pofymoase gene (right). 

5 FIGURE 9. StmcturesofHCVrepliconsandfidl-lengihHCV The adaptive replicon 
S'NTR-EMCV has the S'NTR fused directly to the EMCV IRES iq)stieam of NS3. Another 
ad^ve replicon, HCViep/NS2-5B has flie noo-structural protein, NS2, iqpstream of NS3. A 
fuI14ength HCV cDNA clmie, HCV FL» was assembled. Also, a bidstronic derivative, HCV 
FLr-neo, was assembled where the S'NTR is fused to the neomycin phosphotransferase gene 

10 and the EMCV IRES is upstream oftfae HCV open reading fiame. In both fiiU-length clones, 
the open reading frame conqmses the structural and non-structural regions, from capsid to 
NSSB. In addition, all of the repliccHis and full-length HCV RNAs comprise the mutation 
coding for Ser to lie substitution at position 1 179 of SEQ ID NO:3, in NS5A. 

15 FIGURE 10. SNA replication ofreplicom and fiill-length HCV mAs. The HCV replicons . 
and full-length HCV RNAs shown in FIGURE 9 are r^lication competent 

Detailed Description of the ^vention 

Definitions 

20 Various terms are used herein, \^ch have the following definitions: 

As used herein, "HCV polyprotein coding region** means the portion of a hqiatitis C 
virus that codes for file polyprotein open reading frame (ORF). This ORF may encode 
proteins that are the same or different Aan wild-type HCV proteins. The OSFnoay also 
encode only some of the functional protems encoded by a wfld-type polyprotein coding 

25 region. The proteins encoded Oeran may also be fi:om different isolates of 
HCV proteins may also be encoded therein. 

Thephxase "pharmaceutically aocq)tabl6'' refers to molecular entities and 
compositions that are pliysiol(>gica% tolerable and do not typically produce an allergic or 
similar untoward reactim, such as gastric iqiset, dizdness and the like, when adnunistered to 

30 ahuman. Preferably* as used herehi, the term '*pharmace^ 

by a regulatory agency of the Federal or a state goi^imnent or listed in &e U.^^ 
Pharmacopoeia or other generally Tec<)gnized pharmacopoeia for use in animals* and more 
particularly in humans. Ibe tenn "carrier^ refers to a dihient, adjuvant, excipient, or vehicle 
with which the compomid is administered. Such pharmaceutical carriers can be sterile 

35 liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic 
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origin, such as peanut oil, soybefm oil, tnineral oil, se^^ Water or aqueous 

solution saline solutions and aqueous dextrose and glycerol solutions are preferably employed 
as carriers, particularly for injectable solutions. Suitable pharmaceutical carriers are 
described in **Remington's Pharmaceutical Sciences'* by RW. Martm. 

S The phrase 'therapeutically effective amounts is used herein to mean an amount 

sufELcient to reduce by at least about 1 5 peicent, preferably by at least SO percent more 
preferably by at least 90 percent, andmostpreferably prevent, a cliuically significant deficit 
in the activity, fimction and response of the host Alternatively, a therapeutically efifective 
amount is sufBcient to cause an improvement in a clinically significant condition in host 

10 The term ''adjuvant" refers to a conqx>und or mixture that enhances tte munune 

response to an antigen. An adjuvant can serve as a tissue depot that slowly releases the 
antigen and also as a lymphoid system activator that non-specifically oihances tiie linn 1 1 mc 
response (Hood et al.. Immunology, Second Ed^ 1984, Benjamin/Cmmnings: Motilo Park, 
California, p. 384). Often, a primary challenge with an antigen alone, in the absence of an 

15 adjuvant, wiU&il to eUdt a humoral or cellular immune response. Adjuvants include, but are 
not limited to, complete Freund's adjuvant, incomplete Freimd*s adjuvant, saponin, mineral 
gels such as aluminum hydroxide, sur&ce active substances such as lysolecitfain, pluronic 
polyols, polyanions, peptides, oil or hydrocarbon emulsions, keyhole limpet hemocyanins, 
dinitrophenol, and potentially useful human adjuvants such as BCXJ (bacille Calmette-Guerin) 

20 znACorynebacteriumparvum, Preferably, the adjuvant is pharmaceutically acceptable. 

In a specific embodiment, the term "about" or "approximately" means within 20%, 
preferably within 10%, and more preferably within 5% of a given value or range. 

The tenn "virus infectim" as used herein, refers to the usual way that wild-type virus 
particles become established in host cells. This generaUy includes Imidirig to &e host cell, 

25 uptake, delivery to &e cytosol or nucleus, and initiation of replication. 

The tera ''txansfection" as used heraii, refers to die iiifection of a cell with a 
polynucleotide. The polynucleotide can be DNA or RNA. A preferred method of 
transfecting a cdlwi& an HCV polynucleotide is wi&rq)UcationcQini^^ Delivery 
to permissive cells can be fecilitated by electrcqxxcation, charged liposomes, hig^ salt, DE 

30 dextran,etc. Rq>ticationcornpetentKNAs can also be launched in ceHs after traiisfecfim 
DNA such as plasmids or DNA viruses fliat have been sq^ffopriately eoginemd to inxivide 
transcription iiutiation and terminatian signals. ThetransfectedSNAscanre[Hesentfi]ll- 
length genome RNAs capable of initating a complete rq>lication cycle (including production 
of progeny virus), or tibey may be defective lacking one or more KNA elements or proteins 

35 essential for virion iHX)duction but not KKArephcati The latter RNAs, which are lacldiig 
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in the ability to produce a virion, will be refisrred to generally hei:ein as "replication competoit 
KNAs", "RNA replicons" or •^licons". 

As used berein, die tenn " subpassage" connotes the transfer of a colony from one 
vessel of media to another vessel of media. Examples of vessels of media include dishes, 
5 botdes or test tubes wifli solid liquid growth media. Unless otherwise indicated, 

"subpassage** means the transfer of a colony of HCV-tiansfected cells from a vessel of media 
where flie newly transfected cells were plated to a vessel of media where the colony is 
isolated. 

The tenn "auOientic" is used herein to refer to an HCV polynucleotide, wbe&er a 

1 0 DNA or RNA« that provides for replication mi production of functional HCV proteins, or 
components thereof Hie audientic HCV polynucleotides of the present invention are capable 
of replication and may be infectious, e.g. , in a chimpanzee model or in ti ssue culture, to form 
viral particles (i.e, "virions'*). An auflientic HCV pol^ucleotide of the present invention 
may also be a "replicon", such that it is incapable of producing the full complement of 

IS structural proteiiis to make a rq)Hcation competent infectious virion. However, such 
replicons are capable of RNA rq>lication. Thus, the authentic HCV polynucleotides 
exemplified in the present application contains all of the virus-encoded information, whether 
in RNA elements or encoded proteins, necessary for initiation of an HCV RNA replication 
cycle. The audientic HCV polynucleotides of the invention include modifications described 

20 herein, eg., by site-directed mutagenesis or by culture adaptation, producing a defective or 
attenuated derivative, or an adaptive variant Alternatively, sequences from other genotypes 
or isolates can be substituted for the homologous sequence of the specific embodiments 
described herein. For example, an authentic HCV nucleic acid of the invention may comprise 
flie adaptive mutations disclosed herein, e.g. , on a recipient plasmid, mgineered into tfie 

25 polyprotein coding region of a fimctional dtae from anofter isolate or genotype (either a 
consensus region or one obtained by veiyhigjh fidelity cloning). In addition, the HCV 
polynucleotide of the present invention can include a foreign gene, suchas agqae encodinga 
selectable naatter or a rep or te r protein. 

30 General Description 

The practice of the present invention will employ, unless othcamse indicated, 
conventional techniques of cell culture, molecular biology, microbiology, recombinant DNA, 
and iinmunology,whidi are within &e skill of tibte art Such techniques are explained fully m 
the litmture. See, e.g., Ausubel et al. (ed.) (1993) "Current protocols in molecular Uology. 

35 Green Publishing Associates, New Yoik; Ausubel et aL (1995), "Shcnrt Protocols in Molecular 
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Biology", John Wiley and Scms; Joseph Sambrook et d. (1989), "Molecular Qoniiig, A 
Laboratoiy Manual", second ed.. Cold Spring Haibor Laboratoiy Press; the series, 
METHODS IN ENZYMOLOGY (Academic Press, hic); Amnud Cell Culture [R.I. 
Freshney, ed. (1986)]; Lau. ed. (1999), HEPATITIS C PROTOCOLS, Humana Press. 
S New York; and Immobilized Cells And Enzymes PRL Press, (1986)]; all of which are 
incorporated by reference. 

The present invention is dkected to variants of hq)atitis C virus (HCV) and methods 
for producing the variants. Asusedherem,anHCVvariantisanon-naturally occni^^ 
sequence that is capable ofproductiverq>Ucatian in a lK>stceU^ The genetic sequence of 
10 fliese variants may comprise inserticms, deletions, or base mutations from wild type HCV 
sequences. As fiirdier discussed the variants may be product 
by methods known to ttie skilled artisan (see, e.g., U JS. Patent Application No. 0 8/8 1 1 ,5 66 
(Now U.S. Patent No . ) : Lohmann et al., Sdence 285:1 10-1 13(1999)). Alternatively, as 
further discussed i7|/hi, the variaiits ^ 
IS combination of culture selection and gonetic engineering. 

The variants are in tiie form of DNA or RNA and can be incorporated into any useful 
form of those compounds, for example in extracliromosomal DNA that replicates in a 
microorganism such as E. colt or yeast Licluded among these are plasmids, phage, B ACs, 
YACs, etc. RNA and virions comprising tiie variant are also envisioned as vnUm the scope 
20 of the invention. The variants of the present invention can also be in the form of cassettes for 
ins^on into a DNA cloning vector. The HCV RNAs are envisioned to be conoplementary to 
any HCV DNA disclosed herein. An infectious HCV RNA is a positive strand RNA seated 
fiom the negative strand template of the HCV DNA clone of the invention. 

The variants of &e present invention are not narrowly limited to any particular virus 
25 subtype. Thus, any particular component of die variant, or die entire varia^ 

any HCV subtype. Prefoied subtypes are la and lb, due to the widespread occurrence, as 
well as the large amount of knowledge available for those two subtypes. However, the use of 
any other genotype or subtype, as would be considered within the skill of die art, is 
envisioned as within the scqie of the invention. These subtypes include, but are not limited 
30 to, any subtypes wifliin genotypes HCV-1, HCV-2, HCV-3, HCV-4, HCV-5, and HCV^, 
Moreover, since HCV lacks inoofieadmg activity, ^ virus itself readily mutates, forming 
mutant " quasinqpedes" of HCV that are also ccntenqilated as useful for the present invention. 
Such mutations are easily identified by sequencing isolates fix>m a sub] ect, as detailed herein 

or in U.S. Patmt AnplicatimNo. 08/811,566 (Now U.S. Patent No. ). It would be 

3 5 expected that the methods and compositions disclosed herein are useful for any known 
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subtype or quasi-species, or wy subtype or quasi-species not now known but that is 
discovered in the future. 

The HCV variants of the invention include a S -NTR conserved sequence, which 
generally comprises the S'-terminal sequence GCC AGCC, and which may have additional 

S bases upstream of this conserved sequence wiftout affecting fimctional activity of the HCV 
nucleic acid, hi a preferred embodiment, die S*-GCCAGCX:; includes from 0 to about 10 
additional upstream bases; more prefmbly it includes from 0 to about S upstream bases; more 
preferably still it includes 0, one, or two iq)stream bases. In specific embodiments, flie 
extr^e S'-terminal sequence may be GCXIAGCX:; GGCCAGCX; UCKXAGCX!; 

10 AGCXJAGCC; AAGCCAGCC; GACKXAGCC; GUGCCAGCX:; or GCGCCAGC5C, wherein 
the sequence QCCAQOC is the S'-tmninus of SEQ ID NO:I. However, the scope of flie 
HCV variants of the invention encompasses any functional HCV 5' NTEt» whether now 
known or later discovered. 

The HCV variants of the invention also include a 3 * NIR that comprises a poly- 

15 pyrimidine region as is known in wild-type HCV. These polypyrimidine regions are known 
to comprise, on the positiv&*strand HCV RNA, a poly(U)^ly(UC) tract or a poly(A) tract 
However, the polypyrimidine region of the present invention may also include other 
polypyrimidine tracts that are not now known but are later found to be functional in infectious 
HCV. As is known in the art. Hie polypyrimidine tract may be of variable length: both short 

20 (about 75 bases) and long (133 bases) are effective, altiiough an HCV clone containing a long 
poly(U/UQ tract is found to be highly infectious. Longer tracts may be found in naturally 
^ occurring HCV isolates. Thus, an authmtic HCV nucleic acid of the invention may have a 
variable lengdi polypyrimidine trad 

The 3' N1R also coixqxrises, at its extrenie 3* endl, flie hig^ 

25 of about 98 nucleotides known in the art, and as described in, e.g., U.S. Patent No. 5,874,565, 

U.S. Patent Application No. 08/81 1,566 (Now IJ JS. Patent No. ), and U.S. Patmt No. 

5,837,463. In a specific aspect, die S'-NIResctrraneteronnus is KKAhomol^^ 
having the sequence 

5'.TXK3TGGCTOCATXriTAGOCCrAGlCACG(KTrAGC^ 
30 GCATXJACTXKlAGAGAGTGCTXSATACTXKKXnX^^ (SEQ ID 

N0:2). However, the scope offlxe invention is iiieant to enconqiass HCV variants m 
. HCV3'NTlLtiiataUowsvinisrephcation,wheaiertiiesequau^ 
discovered. Included are 3* NTKs that do not comprise a variable region. 

The HCV variants of the present invention also include a polyprotem coding region 
35 sufficient to allow replicati<moftiie HCV RNA; Thus, flie polyprotem coding region may be 
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deficient in functtonal genes mxxling fhe full complemeat of fhe HCV stnictural genes C, El 
andE2. In addition, fliepolyprotein coding region may coiiq^ 

mutations ftat do not occur in wUd-type HCV strains. Rirther.tibiepolyproteia coding region 
may be chimeric, such that some of ttie genes encoded Ifaarem are fixnn analogous regions of 

S another virus, as discussed ii;/hz. 

The HCV variants ^compassed by flie present invention include variants that do not 
produce virus particles. These variants, which may be tamed "replicons", lack fteabiUly to 
produce a fully functional conqilement of the structural Thetnability 
to produce the functional stractural protein component of the HCV vims may he confened by 

10 deletion ofthegeiies encoding dne, two, or aUtiiree of aesepn>ten^ Alternatively, a 
deletion of a small portion of tiie codiog sequence of one of the struchnal proteins, or a 
mutation in a critical region of Ae coding sequence, or an insertion into Ae coding sequence 
could lead to an HCV tiiat cannot produce virions, b the latter case» the msertion can be any 
sequence tiiat disrupts fhe ability of tiie stnictuial protein fixnn becoming part of a virion, and 

IS can include functional sequences, such as tiiose that encode a reporter gene (such as p- 

galactosidase) or those that confers selectability to the cell harboring die replicon (such as 
neo). The above mampulations are entirely witiiin the skill of the art See, e.g., Lohmann et 
al.,5x(praandExan[q)le 1. As discussed iij/hi, such variants are usefid for studying tqiUcati 
of fhe HCV virus, among oiha: things. 

20 The variants of tiie present invention can also comprise an alt^tion in die coding 

sequence of the polyprotein coding region tiiat does not affect the production of functional 
virions or replicons. These alterations can be such that tiieaiiiino add sequerice of the mature 
protein is not changed fixnn the wild-type sequence, due to tiie degeneracy of the genetic 
code. Such dteratioris can be usefld,e.g.,^ien they introduce or remove a restriction dte. 

Is such that the size of HCV fiagments produced by digestion witii a restriction enzyme is 
altered. Thisprovides a distinguishing characteristic of diatvariaoit,^^ 
to idmtify a particular infectious isolate in a multiple infection animal model, <^ to im>vide 
convenient sites for subsequent oiginearing. Any techrdque for mutagenesis known in die art 
can be used, including but not limited to in vitro site-directed mutagenesis (Hutchinson, C, ei 

30 a/., 1978, J. BioL ChmL 253:6551; Zoller and Smitii, 1984, DNA 3:479-488; Ol^hant et al,, 
1986, Gene 44:177; Hutchinson ei aL. 1986, Proc. Nati. Acad. Sci. U.S A. 83:710], use of 
TAB® linkers (Pharmacia), etc. PCR techniques are prefened for site directed mutagenesis 
[see Higuchi, 1989, "Usmg PCR to fiogineer DNA", in PCR Technology: Principles and 
Applications for DNA Anq^Ufication^ H. Blich, ed, Stockton Press, Chapt» 6, pp. 61-70]. 
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Alterations in the polyprotein coding sequence can also introduce c<Hiservative amino 
acid substitutions in the HCV-encoded proteins. Conservative amino acid substitutions refer 
to the intensbangeabiUty of residues having simi^ Conservatively substitated 

ainino acids can be gioiqped according to tfaecheoaicalimiperties of For 

5 example, one grouping of amino acids includes those amino acids have neutral and 

hydrophobic side chains (A, V, L, I, F, F, and US); anoQier grouping is those amino acids 
having neutral and polar side chains (G, S, T, Y, C, N, and Q); anofliCT grotq)ing is those 
amino acids having basic side chains (K» R, andH); anotibargroiq)ing is those amino acids 
having acidic side chains (D and E); ano&er grouping is fliose amino acids having aliphatic 

1 0 side chains (G, A, V, L, and I); another groupmg is those amino acids having aliphatic- 
hydroxyl side chains (S and T); ano&er grouping is tiiose amino adds having amines 
containing side chains (N» Q» K, R, and H); another grouping is those amino acids having 
aromatic side chains (F» Y, and W); and another grouping is those amino acids having sulfiir- 
containing side chains (C and M). Preferred conservative amino acid substitutions are: R-K; 

15 E-D, Y-F, L-M; V-I, and Q-H. QHisavative amino acid substitutioiis, when conferred on the 
structural proteins, can alter antigenic epitopes, and thus tiie immime reactivity of the virus. 
Those substitutions cotild also alter the function of the non-structural proteins, such that tiie 
virus reproduces at a differrat rate or is altered in its ability to replicate in cell culture or in an 
organism. See, e.g.. Example 1, where replicon IV is adaptive to cell culture conditions due 

20 to the conservative amino acid substitution Ser Cys in tiie NS5 A protein. 

Alterations in flie polyprotein coding region could also introduce nonconservative 
anuno acid substitutions in <»ne or more of the proteins eooc^^ Nonconservative 
substitutions would be eaqiected to alter protem fimction more drastically tiian conservative 
substitutions, and wodd tims be nKm likefy than conservative su^^ 

25 pbenotypic characteristics of the virus sucbas replication rate, adaptation to cell culture or in 
v7V!9cdture» and disq[)]ayed antigenic determinants. Bxanq>les are several additive mutations 
in the NSS A coding region desmbed in tiie , 

In some embodiments of the invention, tbie polyprotein coding regicm has a consensus 
sequeiK:e derived fiom more than one HCV isolate. For example, an auflienticHCV nucleic 

30 acid of ttie invention may comprise a S' and 3' sequence fiom any one subtype of the virus and 
a polyprotdn region fiom any other subtype. Altemativdy, only one oftiie proteins encoded 
in the polyprotein mif^ be fixym another viral subtype. .In this way, the efifect of a particular 
protein in conferring characteristics of a particular strain (e.g., reduced viruloice, increased 
replication rate etc.) can be studied 
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Chimms wi& oAer viruses, such as with bovine viial diarrhea virus, or another 
flavivinis, are also envisioned. See, e.g.,PCT/US99/088S0,incoqxirat6d herein by referee 
In these embodiments^ componaits of the functional clones can be used to construct chimeric 
viruses for assay of HCV gene functions and inhibitors fliereof [Filocamo et aL, J. Virol. 71: 

5 1417-1427 (1997); Hahm ei oL, Virology 226: 318-326 (1996); Lu and Wimm^r, Proc Nail 
Acad Sci USA 93: 1412-7 (1996)]. In one such extension of &e invention, fimctional HCV 
elements such as the S' IRES, jHOteases, RNA helicase, polymerase, or 3' NTR are used to 
create chimeric derivatives of BVDV whose im)ductive replication is dependent on one or 
mcmoffhese HCV elements. Such BVDV/HCV chimeras can then be used to screen for and 

10 evaluate antivirdstrat^es against tiiese fimctional components. 

Chimeras yi/bm a ^ne mcodixig a structural or nonstructural protein fiom a closely 
related vinis such as OB virus Brq>lacesfliecorresp(mding HCV gene would also be 
expected to be fimctionaL See, e.g.,Buddewiczetal., 2000, J. Fi>v/: 7^,4291-4301. 

IS Othar alterations in die polyprotein coding region contemplated b^ 

invention include deletions or insertions in the sequmce. Such alterations may also alter 
replication rate, adaptation to various growth conditions, or antigenic determinants. A 
preferred exan^le of a usefiil deletion includes the 47 amino acid deletion and replacement of 
Ser 1182 to Asp 1229 of SEQ ID NO:3 with Tyr, which is an adaptive mutation in the NS5A 

20 that provides greater transfectionefiSdency than HCVs with wild-type NSSA^ See Example 
1. 

Insertions into 11^ polyprotein coding region can be of any length and into any area of 
the region, provided tiie modified HCV is still able to replicate. Preferably, the insertion is 
engineered in firame with tiie rest of the polyprotein coding region, to allow correct translation 
25 of the polyprotein region downstream fiom the insertion. 

Insertions into die polyprotein coding region could introduce a gene encoding a 
hetmlogous protein. The dboice of faeterologpus protein is not narrowly limited and can 
include a pixytein that is tiieR9)eutic to tiie infected host or ceU, 0^ 

and purified for another putpose. Particularly usefid hetearologous genes include those used 
30 for detection of die variant (i.e., reportor genes), or for selection of cells having the variant 
Nonlimitmg examples of rqxnrter genes useful in the present invention include p- 
galactosidase, p-glucunniidas6» fireOy or bacterial hidferase, green fluorescent protem (GFP) 
and humanized derivatives thereof oeUsur&cemaites, and se^ Suchproducts 
are eitiier assayeddirectiy<n- may activate die expression or activity of additional rqxntears. 
35 Nonlimiting examples of selectable marioers for mammalian cells include, but are not limited 
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to, the genes encoding dibydtofolate reductase (DHFR; mettiotracate resistance), fbymidine 
kinase (tk; methotrexate resistance), poroniycin acetyl transferase (par, puron^cin 
resistance), neomycin resistance (neo; resistance to neon^cin or G418), mycophenolic acid 
resistance (gp/),hygroniycin resistance, blasticidiniesistaiice, and r^ Other 

5 selectable marters can be used in different hosts such as yeast (iira3, hisS^ leUl^ /ipl). 

The present inventicm also encompasses HCV variants that have alterations in flie 
noncoding regions of flie virus. For example, the foreign gene discussed above can also be 
inserted into a noncoding region of &e virus, provided the region m& the insert contimies to 
be sufBcientlyfimctional to allow replication. To provide for translation of a foreign gene 

1 0 inserted into a noncoding region, tiie foreign, gene must be operative^ linked to translational 
start signals, preferably an intemd ribosome entry site (DUBS) derived fix>m cellular or viral 
mRNAs [Jang e/ a/., £/tz);ine 44:292-309 (1991); Macgak and Saniow,//atiire 353:90-94 
1991); MoUa et al.. Nature 356: 2SS-2S7 (1992)]. In essence, this strategy creates a second 
cistron in the variant, separate fixnn the polyprotein coding reg^^ A preferred IRES 

IS is the encephalomyocarditis virus (EMCV) IRES. 

The foreign gene can also be insaled into the 3* imi or the 5' NHL Inthe3*NrR, 
the foreign gene/IRES cassette is preferably inserted into the most S*, variable domain. 
However, insertions are also envisioned for other regicHis of the 3* NTR, such as at the 
jimction of the variable region and the polypyrimidine region, or within tiie polypyrimidine 

20 region. In the S*NTR, the foreign geiie is preferably insoled into die area just a^^^ 

the internal HCV IRES. la these variants, the foreign gene is engineeied to be operably 
linked to the HCY IRES. Where this is the case, it is preferred that the second IRES (e.g., an 
EMCV IRES) is oigineeiied just 5' to the po]ypn>tem coding re^ 
thatregion. SeeExan9leandLolin]asnetal.,^i(pra. 

25 Some of flie above strategies ibr functional eaqiression of heterologous genes have 

been previously desoibed. SeeBredenbeekandSice,(1992)^rafQrreview; see, also 
Figure 2, which is also Figure 2 of U^. Platent Application No. 08/81 1,566 (Now U.S. Patent 
No , \ 

Additionally, noncodmg region alterations such as mutations, ddetions or insertions 
30 that do not eacode a foreign protem are within flie scope of the invoition. Forexample, 
mutations, deletions of insertions in fiie variable or polypyrimidine regions of the 3' NTE^ 
including deletions of the entire variable region, or in the 5* KIR. region, that oeate or destroy 
restriction sites or make fiie variant odierwise identifiable can be used advantageously to 
create a "tagged" variant See, e.g.,Bxainple, where a imitation in tiie variable region of ttie 3' 
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NTR created an easily identifiable Avail restriction site, and where a deletion in the 
polypyrimidine region created another identifiable variant 

The polyprotein coding sequence can compise mutants with desirable functional 
adaptations such as adaptive or attenuated variants. These improved variants can be siqierior 
5 in any desired characteristic. Nonlinutnig examples of characteristics that can be i^ 

by the present methods include more rapid or more accurate replication in vivo or in culture, 
improved transfection efficiency, improved ability to establish siibpassaged cell lines, ability 
to infect a host or a host cell line, virulence, and attenuation of disease symptoms. 

Such HCV variants may be adaptive, e.g. , by selection for propagation in animals or 

10 in vitro. See, e.g., Exanq)le. Alternatively* ^variants can be engineefed by design to 
comprise the functional adqitation. See, e.g.,Exanq>le,whe]:e a deletion was designed that 
had increased transfection efficiency and ability to be subpassaged to create a stable cell line, 
siq>porting persistent HCV replication. 

Non-fimctional HCV clones, e.g., that are incq)able of genuine replication, flxat fidl to 

1 5 produce HCV proteins, tibat do not produce HCV KNA as detected by Northern anafysis, or 
that fiul to infect susceptible animals or cell lines in viiro, can be corrected usixig components 
ofthe variants of the present invention. By conq;>aring a variant of an authentic HCV nucleic 
acid sequence of the invention, with the sequence offbe non-functional HCV clone, defects in 
the non-fimctional clone can be identified and corrected, and the corrected, replicating variant 

20 could have characteristics like ^variant, such as an adaptive mutation, etc. All of the 
methods for modifying nucleic acid sequences available to one of skill in the art to effect 
modifications in the non-fimcticHial HCV g^ome, including but not limited to site-directed 
mutagenesis, substitution of ttie functional sequence fixnn an authentic HCV variant for the 
homologous sequence in the non-functional clone, etc. 

25 Adaptation ofHCVfor more improved cell culture cliamctem Replication and 

transfection efficioacy and stability of virions and replicons ^t have wild-type polyprotein 
replication in cell culture is inefficient That is, cells transfected with, e.g., RNA transmpts 
of clcmes of tibese strains rq>licate slowly in culture and the transfected cells are difficult to 
maintain. Additionally, transfection efficiency is poor. Thatis, very few cells that are 

30 transfected with die RNA rq)licon are able to support HCV replication. See, e.g.. Example 1 
and Lohmann et al., ^ra, where less than 0.01% of Buh-7 cells transfected with RNA 
transcripts of replicons ttat have a wild-type (genotype 1 , subtype lb) nonstructural 
polyprotein coding region grew into colonies on thepetri dish where the transfectants were 
'plated. FurtfaermcMre, a low percentage ofcoloniesfliataro%fix>m the original platm^ 
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could be subpassaged onto anofher dish of media to form an isolated stable cell line 
supporting HCV rqplication. 

"Transfection efBciency" is defined by determining the percent of cells having 
rq)licating HCV KNA that continue to translate proteins encoded by die transfected nucleic 

5 acids. The easiest way to measure this is by detennumigfte pen^mtage of 

charactocisticconforedbylheHCVKNA. See> e.g.. Example 1, where replicons comprising 
a neo gwe conferred G4 1 8 resistance to flie transfected cells, and whoe tihie cells were G4 1 8 
resistant after dividing and forming colonies on ttie dish wheie tiie transfected cells wm 
plated. In that e3ranq>le,G418 resistance would not persist sufGdendy for colom 

1 0 mdess fba HCV KNA was able to rq>licate and partition into die dividing cells while 

continuing to rqilicate and translate die neo gene to confer G4 1 8 resistance. Transfection 
efBciency is thus replication dqpenden^ in diat the transfected HCV must replicate, 
transmbe, and translate the measured charactmstic (here, G418 resistance). Indie context of 
the neo selectable maxker» dus method of determining transfection efBciency is termed 

IS '^r^lication-dependent neomycin resistance**. Thisistheprefenedway of measuring 

transfection efBciency because it only measures transcription fiom HCV that established itself 
sufBciently to replicate and partition into dividing cells to form a colony. 

Another disadvantageous cell culture characteristic of HCV nucleic acid that has 
wild-type nonstructural polyprotein genes is that onfy a low percentage of colonies that form 

20 after transfection and selection are able to continue to be maintained upon subpassage as 
continuous cell lines harboring replicating KNA. Thiswas<3%inLolimannetal.,as 
discussed 5ru/7ra. 

Disadvantageous characteristics of HCV having wild-type nonstructural polyprotein 
genes can be reduced by utOizmg certain adaptive mutations and deletions in die NSS A 

25 coding region or elsewhere as disclosedlierein. Preferred mutations comprise altraations in 
the encoded amino acid sequence in aregim of IheNSSAlhat is just S* to the coding region 
of the "interferon sensitrvity-determining region" (ISDK). Specifically, various mutations 
witUn about SO nucleotides 5* to the ISDR, more preferably within about 20 nucleotides of 
the ISDR, where die encoded amino add sequence is altaed, have the effect of adqiting an 

3 0 HCV to have higher transfection ^ciency and increased ability to withstand subpassage to 
establidi a ceU line harixxriiig persistent HCV replicatioiL Specific mutations having tins 
effect include Ser to lie at amino acid 1 179 of SEQ ID NO:3 (subtype lb nonstructural 
polyprotem r^on), conferred, for example, by the mutation g to t at position 5336 of SEQ ID 
NO:6, embodied in SEQ ID NO:8 (nucleotide[nt]) and SEQ ID NO:16 (amino acid[aa]); Arg 

3S to Gly at amino acid 1164 of SEQ IDN03, confored, for example, by the mutation from a to 
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g at position S289 of SEQ ID N0:6, embodied in SEQ ID N0:9 (nt) and SEQ ID N0:17 (aa); 
Ala to Ser at amino acid 11 74 of SEQ ID N0:3, conferred, for example, by the mutation from 
g to t at position S320 of SEQ ID NC>:6, CTdxkUed in SEQ ID NO: 10 (nt) and tibe NSSA 
amino acid sequence of SEQ ID NO: 19; Ser to Cys at amino acid 1 172 of SEQ ID N0:3, 

S conferred, for example, by the mutation c to g at position S3 IS of SEQ ID NO:6, embodied in 
the NSSA gene SEQ ID NO: 1 1 and tiiie NSSA amino acid sequmce of SEQ ID NO:20; and 
Set to Fro at amino add 1 1 72 of SEQ ID NO: 3 , conferred, for example by the mutation t to c 
at position S3 14 of SEQ ID NO:6, raibodied in tihe NSSA goie SEQ ID NO: 12 and Hie NSSA 
amino acid SEQ ID N0:21. The adq>tiveefiG»tofthese mutations is surprising since flus 

10 region ofHCS^ is nonnally conserved among HCV isolates. Additionally, deletions within 
flie ISDR, including deleticms of fte entire ISDR and various flanking sequences, cause this 

adaptive effect Atmrng Aese dcletims w ihe snhgritiirimi ftf the IS^H and fljmlffng RegiiPTifigt 

comprising amino acids 1182to 1229of SEQ ID NO:3 with a tyrosine, conferred, for 
example, by the deletion of nt 5345-5485 of SEQ ID N0:6, and embodied in SEQ ID N0:7 

IS (nt) and &e NSSA amino acid SEQ ID Nb:14. 

HCV" variants comprising mutations adaptive to ceU culture may also be attenuate^ 
that is impaired in its ability to cause disease establish cfanmic infections, trigger autoimmune 
responses, and transform cells. 

Tbe present invention also discloses methods for selecting for adaptive HCV variants. 

20 These methods comprise the use of an HCV virion or preferably a replicon, which further 
conqnises a doiiiinant selectable marioa' such as a neogoie. Cells are transfeeted with these 
variants. Hie transfectants are plated into selection media, such as G41 8 when the /leo gene is 
utilized in the variant Colonies timt arise to exhibit resistance to the selectable inaiker are 
subpassaged into fresh selection media. HCV in colonies tiiat withstand subpassage to 

2S establish a cell line harbiorii^ HCV replication can be isolated and used to transfect additicmal 
cells. Any of these colonies Haat show increased transfection efiBdenqr or otiier desirable 
chanusteristics, such as die ability to withstand subpassage, aie adap^^ 
adaptive nature oftiie variant is conferred by at least one mutation or deleti^ Seleciedareas 
ofthe HCV in these adqitive variants are sequenced. Ptefiarably, at least the NSSA is 

30 sequenced. More pref«d>ly, the entire polyprotein coding region is sequer^ Ai^ 

mutations in tiiese variants can be further evaluated to detomine the adaptive nature of the 
mutations. That evaluation prefetably involves recreatnig die mutation in an o& 
type coding region and determining if die recreated HCV mutant exhibits the adaptive 
phenotype of the original mutant 
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Adaptive mutations could also be manifested, 1^ (i)alt^mgfhe 
trqpism of HCV KNA replicatioi^ (ii) altering viral products responsible for deleterious 
effects on bost cells; (iii) increasing or decreasing HCV KNA replication efiBciency; (iv) 
increasing or decreasing HCV RNA packaging e£5ciency and/or assembly and release of 

5 HCV particles; (v) altering cell tropism at tiie level of receptor binding and entry. Thus, the 
engineered dominant selectable marker, whose expression is depeadeaat upon productive HCV 
KNA replication, can be used to select for adaptive mutations in either the HCV replication 
machin»yortfaetransfectedhostoell,orbotiL la addition, dominant selectable markieis can 
' be used to select for mutations in tiie HCV replication machinery that allow hig^ levels of 

10 KNA. replication or particle fonnation. In one exanq)le,engmeered HCV derivatives 

expressing a mutant finrn of DHFR can be used to confer resistance to methotrexate (MIX). 
As a dominant selectable marker, mutant DHFR is inefBcient since nearly stoichiometnc 
amounts are required for MIX resistance. By successively increasing concentrations of MIX 
in the mediimi, increased quantities of DHFR will be required for continued survival of cells 

IS harboring the rq^licating HCV RNA. Ihis selection scheme, or similar ones based on this 
concept, can result in the selection of mutations in the HCV KNA rq>lication machinoy 
allowing higher levels of HCV RNA replication and RNA accumulation. Similar selections 
can be applied for mutations allowing production of higher yields of HCV particles in cell 
culture or for mutant HCV particles with altered cell tropism. Such selection schemes involve 

20 harvesting HCV particles from culture supematants or after cell disruption and selecting for 
MIX->resistant transducing particles by reinfection of naive cells. 

Methods similar to the above can be used to establish adaptive variants with 
variations in charactmstics such as ibc increased or decreased abibty to cause infection, the 
ability to cause infection in a host that,wUd-type strains are unable to infect; or cells of such a 

25 host • 

The invention also provides host cell lines transfected with any of dze HCV DNA (or 
HCV KNA) as set fortti above. Examples ofhostceUs include, but are by no ineanslirnited 
to, the group consisting of a bacterial cell, a yeast cell, an insect cell, and a mammalian cell. 
Preferably, the host cell is cqsable of providing for expression of functional HCV SNA 

30 replicase, virions or vmis particle proteins. 

In arelated aq>ect, as briefly described above, tiie invention provides a vector for 
gene therapy or a gene vaccine (also tenned herein a genetic vaccine), in which a 
hetOT)logous protein is inserted into the HCV nucleic acid under conditions that permit 
expression of the heteroli^us protemu These vaccines can be eilfaer DNA or KNA. In 

35 particular, tiie invention provides an infectious hqiatitis C virus (HCV) DNA vector 
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conaprising fiom 5' to 3 ' cm the positive-soise DNA, a promoter; an HCV S'-non-translatBd 
region (NTR) containing the extreme S'-tmninal sequence CKXAGCC; an HCV polyprotein 
coding region comprising a coding region for a heterologous gene; and a 3' non-translated 
region (NTR). Prefisrably, the promoter is selected from ttie group consisting of 

5 bacteriophage T3, T7, and SP6. 

la the embodiments of the invention wheane the functional HCV nucleic acid is DNA, 
it nmyfurdier comprise a promoter opexatively associated with the S' NTR. For example^ but 
not by way of limitation, thepromoter may be selected fiom flie group consisting of 
bacteriophage T7, T3, and SP6. However, any suitable promoter for transcription of HCV 

1 0 genomic SNA co]i:e£q;x>nding to the HCV DNA can be used, dqpoading on Hbs specific 
transcription system employed. For exanq>le^ finr nuclear transcription C&g'., in an animal 
transgoiic for HCV), an endogenous or viral promoter, such as CMV, may be used. 
Additionally, these promoter-drivm HCV DNAs can be incoipoialed into an 
extrachromosomally replicating DNA such as a plasmid or a phage. 

IS Varioiis uses ofthe invention variants are envisioned herein. Uses relevant to thoapy 

and vaccine developmmt include: (i)fiiegaieration of defined HCV virus stocks to develop 
in vitro and in vivo assays fcnr virus neutralization, attachment penetration and entry^ (ii) 
structure/function studies on HCV proteins and SNA eloaoeDts and identification of new 
antiviral target^ (iii) a systematic survey of cell culture systems and conditions to identify 

20 those that support wild-type and variant HCV RNA replication and particle release; (iv) 

production of adaptive HCV variants capable of m<nne efBcimt replication in cell culture; (v) 
production of HCV variants with altered tissue or species tropism; (vi) establisfamoat of 
alternative animal models for inhibitor evaiuation including those supportmg HCV variant 
replication; (vii) developmoit of ceU-fiw HCV leplicaticm assays; (viii) production of 

25 immunogenic HCV particles for vaccination; (ix) mgineering of attenuated HCV derivatives 
as possible vaccine candidate^ (x) oigineering of attenuated or defective HCV derivatives for 
expression of heterologous gene products for gene Iher^ and vaccine applications; (xi) 
utilization ofthe HCV glycoproteins for targeted delivery of therapeutic agents to the liver or 
other cell types with appropriate receptors. 

3 0 The invoition fiirSier provides a me&od for infecting an animal witii HCV variants, 

wha:e tiie method comprises adndnistering an infectious dose of HCV variant RNA prepared 
by transcription of infectious HCV variant DNA The invention extends to a non-human 
animd infected witii HCV variants or Iransfectedwitii HCV variant RNA or D Similarly, 
the invmtion provides a meOiod for propagating infectious HCV variants in vitro comprising 

35 . culturing a ceU line contacted wilh an infectious amount of HCV variant RNA prq^ 
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transcripticm of the infectious HCV DNA, as well as an fa vitro cell line infected willi HCV 
variants. In a specific embodinient, the cell line is a hepatc^^ 
infected wifh an HCV variant in wUdb an IRES-antibiotic resistance cassette has been 
engmeeced to provide for selection. The variant may also comprise the adaptive mutations 

S described above. 

In accordance wifh the gene therapy (genetic vaccine) embodiment of the invention^ 
also provided is a method finr transducing an animal capable of HCV RNA rq>lication wifh a 
heterologous gene» conqiristng adnunistering an amount of an HCV variant SNA prepared fay 
transcription of tiie HCV variant DNA vector. 

10 In another onbodiment; the invention provides a method for producing HCV particle 

proteins conqnising culturing a host oqnession cell line transfected with an HCV variant of 
the invention under conditions tiiat permit expression of HCV particle proteins; and isolating 
HCV particle proteins fiom the cell culture. In a specific embodiment, such an expression 
cell line may be a cell selected £rom Ae ffovp consisting of abacterial cell, ayeast cell, an 

15 insect cell, and a mammalian cell. 

The invention fiirflier provides an HCV virion compri sin g an HCV variant RNA 
genome. Such virions can be used in an HCV vaccine, preferably after attenuation, e.g., by 
heat or chemical treatment, or through selection of attenuated variants by the methods 
described above. 

20 The in vivo and in vitro HCV variants of the invration permits controlled screening 

for anti-HCV agents (Ic, drugs for treatment of HCV), as well as for evaluation of drug 
resistance. An in vivo metix>d for screening for agents capable of modulating HCV 
replication may conqnise administering a candidate agent to an animal containing an HCV 
variant, and testing for an increase or decrease in a level of HCV variant infection, replication 

25 or activity compared to a level of HCV variant infection, iq)lication oractivity in the animal 
prior to administration of die candidate agen^ wfaerem a decrease in tiie level of HCV variant 
infection, rq[>licatioii or activity compared to the level of HCV variant infection, implication or 
activity in tiie animal prior to administratis of the candidate agent is indicative of tiie ability 
oftheag^ to inhibit HCV variant infection, rqplicatian or activity. Testing for tiie level of 

30 HCV variant infection or replication can involve measuring the viral titer (eg., KNA levels) 
in a serum or tissue sample fiom the animal; testing for the level of HCV variant activity can 
involve measuring liver enr^mes. Altenmtivety, an in vilromefhod for screening for agents 
capable of modulating HCV replication can comprise contacting a cell line sq^K^rtrng a 
replicating HCV variant with a candidate ag^t; and thereafter testing for an increase.or 

35 . decrease in a level of HCV variant rq>Hcation or activity coinpared to a level of HCV variant 
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leplicaticm or activity in a control cell line or in the cell line prior to adnunistration of flie 
candidate agent, wherein a decrease in flie level of HCV variant replication or activity 
conqpared to the level of HCV variant rq)lication or activity in a control cell line or in flie cell 
line prior to administration of flie candidate agent is indicative of the ability of the agent to 
S inhibit HCV variant rq)lication or activity. Jn a specific embodiment, testing for tibe level of 
HCV variant replication in vitro may involve measuring the HCV titer, (e.g,, KNA levels) m 
ttie cell culture; testing for the level of HCV activity in vitro may involve measuring HCV 
replication. 

Jn addition to ttie specific HCV variant DNA clones and related HCV variant RNAs, 
10 the mvention is directed to a method finr preparing an HCV variant DNA clone that is capable 

of replication in a host or host cell line, comprising johung fiom S' to 3' on fte positive-sense 

DNA a promoter; an HCV 5' non-trandated region (NIK) an HCV polyprotein coding region; 

and a 3 * non-translated region ^fTR), whm at least one of these regions is not a naturally 

occurring region. Piv&iably,ftepromotBr is selected fitmifibegroiqi consisting of 
15 bacteriophage T7,T3, and SP6. hi a specific embodiment, tiie extreme S'-tenninal sequence 

is homologous to SEQ ID NO: 1, &g., the S'-tenninal sequence may be selected fiom &e 

groiq) consisting of GOCAGCQ GGCCAGCC; UGOCAGCC; AGOCAGCC; 

AAGOCAGCC; GAGOCAGCC; GUGCCAGCC; and GCGCCAGCC, whorein the sequence 

GOCAGOC is fte 5'-tamimis of SEQ ID NO:l. 
20 The 3'-Nm polypi for use in the method ofpreparing an HCV variant DN^ 

may include a long poly-U region. Similarly; the 3'-NTR extreme terminus may be RNA 

homologous to a DNA having flie sequence 

5'-TGGltKKnxXATCITAG0CC^^ 

GCATGACIXX^AGAGAGTGCTXJATACTGGCCT (SEQ ID 

25 NO:2); in a specific embodiment, tibe 3'-NrR extreme tennmus has tibie fcnegoing sequence. 

Components €ffimctional HCV variant DNA clones. Components of the fimctional 
HCV variant DNA described in this invention can be used to devel(q>ceU-fiee,ceUcult^ 
and aninoal-based screoung assays for known or newly identified HCV antiviral targets as 
described tn/hi. F<nr each sdected target; it is preferred tiiat the HCV variant used has the 
30 wild-type form of the targ^ Examples of known or suq^ted targets and assays include [see 
Houghton, In "Fields Virology" (B. N. Fields, D. M. Rnipe and P. M. Howl^, Eds.), Vol. 
pp, 1035-1058. Raven Press, New York (1996); Rice, (1996) stq}ra; Rice et al., Antiviral 
Therapy 1, SuppL 4, 11-17 (1997); SYan^otobao^Hepatology 21,:887-8 (1995) for reviews], 
but are not limited to, the following: 
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The highly conserved S' NTR, ^ch ccmtaiiis elements essential for translatian of tbe 
inconiingHCVgqaoine SNA, is one target It is also likely fhatfhis sequence, or its 
, coinplenicait, contains RNAl elements ixnportantfo^ . 
Potential thersqpeutic strategies include: antisense oligonucleotides (stpra); trans-acting 

5 ribozymes (nipra); KNA decoys small molecule compounds interfering with ttie function of 
tins element (ttiese could act by binding to &e RNA element itself or to cognate viral or 
cellular factors required for activity). 

Another target is tibe HC^ C (cq)sid or core) protein, which is In 
is associated with fte following fimctions: KNA binding and specific encapsidation of HCV 

10 genome IQ^lA; transcriptional niodulation of 37:209-220 
(1995)] and other inral [SUh er o/L, J. ViroL 69: 1160-1171 (1995); Shih ei al., J. ViroL 67: 
5823-5832 (1993)] genes; binding of ceOuhrheUcase [You etal., J. Virol 75:2841-2853 
(1999)]; ceUular transformatian [Ray ei aL, J. ViroL 70: 4438-4443 (1996a); Ray et al., J. 
Biol Chem. 272:10983-10986(1997)]; prevcaition of apoptosis [Ray et al, Virol 226: 

15 176-182 (1996b)]; modulatioa of host immune responscthrough binding to members of flie 
TNF receptor superfemily [Matsumoto et al, J. Virol 71: 1301-1309 (1997)]. 

The El, E2, andpeihapis the E2-p7 glycoproteins that foim the components of the 
virion envelope are targets for potentially neutralizing antibodies. steps where 
intervention can be targeted include: signal peptidase mediated cleavage of these precursors 

20 from Ae polyprotein [Lin et al., (1994a ) snqjra]; ER assembly of the E1E2 glycoprotein 
complex and association of these proteins with cellular chaperones and folding machinery 
[Dubuisson et al, (1994) siqyra; Dubuisson and Rice, 7. Virol 70: 778-786 (1996)]; 
assembly of virus particles including interactions between the nucleocapsid and virion 
envelope; transport and release of virus particles; &e association of virus particles witii host 

25 components such as VLDL [Elijikata et a/./(1993) supra; Thomssen et al., (1992) supra; 
Tbomssen et al, Med. Microbiol Immunol 182: 329-334 (1993)] which may play a role in 
evasion of immune aurveiDance or in bmding and entry of cells expressing the LDL receptor; 
conserved and variable detmninants in tiie vhion which are targets fx neutralization by 
antibodies or whichbind to antibodies and fiunlitate immune-enhanced infection of cells via 

30 interaction witii cognate Fcrecqytors; conserved and variable determinants in 1h^ 

important fen: receptor binding and entry; virion detenninants participating in entry, fusion 
with cellular membranes, and uncoatingtiie incoming viral nucleoci^d. 

The NS2-3 autoprotBase, ^ch is required for cleavage at the 2/3 site is a furflier 

target 
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The NS3 serine protease and NS4 A co&ctor which fonn a complex and mediate four 
cleavages in the HCVpolyprotein [see Rice, (1997) ^/^ni for review) is yet anolher suitable 
target. Targets include the serine protease activity itself the tetrahedral Zn^ coordination site 
in tibie C-terminal domain of the serine protease; flie NS3-NS4A co&ctor interaction; &e 
5 membrane association of NS4^ stabilization of NS3 by NS4A; transforming potential of the 
NS3 protease region [Sakamuro eioL, J Virol 69: 3893-^ (1995)]. 

The NS3 RMA-stimulated NlPase [Suzich ei al,, (1993) sng^ra], SNA helicase [Jin 
and Peterson, Arch Biochem Biophys 323: 47-53 (1995); Kim et al., Biochenu Bicphys. Res. 
Conumm, 215: 160-6 (1995)], and SNA binding [Kanaie^a2.«F£S5i^ 376:221-4(1995)] 
10 activities; AeNS4A protein as a component of flie SNA replication 001^ 
potential target. 

The NS5A protein, another repUcationconqxment, represents This 
protein is phosphoiylated predominandy on serine residues [Tanji et al., J. Virol 69: 
3980-3986 (15)95)]. Transcription modulating, cell growth promoting, and apoptosis 

15 inhibiting activities of NS5A [Ghosh et al., 7. Biol Chenu 275:7184-7188 (2000)] can be 
targeted. Other characteristics ofNS5A that could be targets for tiierapy include the kii^^ 
responsible for NS5 A phosphoiylation and its interaction widi NS5 A; and the intmction with 
NS5A and otfaor components of the HCV rq>lication conq>lex. 

The NS5B SNA-dependent SNA polymerase, which is the enzyme responsible for 

20 the actual synthesis ofHCV positive and negative-strand SNAs, is another target Specific 
aspects ofits activity include fte polymerase activity itself [Befareuse^ a/., £Af^ 15: 
12-22 (1996)]; interactions of NS5B with otiier replicas components, including the HCV 
SNAs; steps involved in the initiation of negative- and positive-strand SNA synthesis; 
phosphorylation of NS5B O^wang et aL, Virology 227:438 (1997)]. 

25 Other targets include structural or nonstructural protein functions important for HCV 

SNA rq;)Ucation and/or modulation of host ceUfimction. Possible hydrophobic inotein 
conqxments cqnble of tcfosang diannels inq)artant for viral entry, egress or modulation of 
host cell g^e esqnession may be targeted. 

Thie 3' NTS, especially tiie highly conserved elements ^ly (UAJQ tract; 98-base 

30 terminal sequence) can be targeted. Therapeutic iqiproachesparanel those described for tiie 5' 
NTS, excqpt that tins portion of tiie gmome is likely to play a role in the initiation of 
npgative-strand synfliesis. ttiniv also be involved in otiier aspects ofHCV SNA replication, 
including translation, SNA stability, <n: padcaging. 

Tbe functional HCV variants of tiie present invention may encode all of the viral 

35 proteiiis and SNA elements required for SNA packagiiig. These elements can be targeted for 
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developmeot of antiviral compounds. Electropluxretic mobility shift, UV cross-linking, filter 
bindi]% and Hiree-faybrid [SmOupta ei aL, Proa NatL Acad. ScL USA 93: 8496-8501 
(1 996)] assays can be used to define the protein and RNA elements important for HCV RNA 
packaging and to establish assays to screen for inhibitors of this p^ Such inhibitors 
5 mig^t include small molecules or RNA decoys produced by selection in vitro [Gold et aL, 
{\99S)suprd\. 

Complex Ubraries of the variants of &e piesoit invention can be prepared using PGR 
shufQing, or by inc(«porating randomized sequences, such as are generated in ""pqptide 
display^ libraries. Usmg the "phage method** [Scott and Smith, 1990, Sdence 24P:386-390 

10 (1990); Cwirla, et aL, Proc NaiL Acad. Sd USA,. «7:6378-6382 (1990); Devlin et aL, 
Science, 24^:404-406 (1990)], voy large libraries can be constructed (10^-10^ chemical 
entities). Qonesfiom such libraries can be used to ^nerate other variants or ^ 
using various HCV subtypes. Such variants can be generated by methods known in &e art, 
without undue experimentation. 

15 A clone ttiat includes a -pxisner and run-off sequoice can be used directly for 

production of functional HCV variant RNA. A large numbo: of vector-host systems known in 
the art may be used. Examples of vectors include, but are not limited to, E. coli, 
bacteriophages such as lambda doivatives, or plasmids such as pBR322 derivatives or pUC 
plasmid derivatives, ag:, pGEX vectors, pmal-c, pFLAG, pTET, etc. As is well known, the 

20 insertion into a cloning vector can, for ^cample, be acconq>lished by ligating the DNA 

fragment into a cloning viector that has complementary cohesive teswim. Howevo*, if the 
complemoitary restriction sites used to fiiagmoit the DNA are not present in the cloning 
vector, the ends of die DNA molecules may be oizymatically modified. Alternatively, any 
site desired could be produced by ligating nucleotide sequences (Unkers) onto the DNA 

25 tmnini; diese ligated linkers may conqnise specific chemically synthesizedfoligonucleotides 
encoding restriction endomiclease recognition sequences. Recombinant molecules can be 
introduced into host cells via transfisnnatioii, transfisction, mfection, electroporation, etc., so 
that many copies of the gene sequence are generated 

30 Expression of HCV KNA and Polypeptides 

Hie HCV variant DNA, which codes far HCV variant RNA and HCV protems, 
particularly HCV RNA repUcase or virion proteins, can be inserted into an qipropriate 
expressicm vector, a. e. , a vector wiudi contains fte necessary elements for &e transa:q>tion 
and translation of the inserted protem-ooding sequence. Such elements are termed herein a 

35 "promoter." Thus, the HCV variant DNA of the invention is operationally (or operably) 
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associated with a promoter in an expression vector of the invention. An e^qiressian vector 
also preferably includes a leplicatioii origin. The necessary transcriptional and translational 
signals can be provided on a reconibiiiant expression vector. In a preferred embodimotit for in 
viY^ synthesis of functional RNAs, the T7, T3, or SP6 promoto* is used. 

5 Potential host-vector systmis include but are not limited to mammalian cell systems 

infected with virus recombinant (eg., vaccinia virus, adenovirus, Sindbis virus, Semliki 
Forest virus, etc.); insect cell systems infected with recombinant viruses (e.g:, baculovirus); 
microorgaiiisms such as yeast containing yeast vectors; plant cells; or bacteria transformed 
with bacteriophage, DNA^plasmidDNA, or cosmidDNA. The eTqiression elements of 

10 vectors vary in their strengths and qiecificities. Depending on the host-vector system utilized, 
any one of a mmiber of suitable transoriptiQn and translation elements may be used. 

The cdD into which the recombinant vector conqirising the HCV variant DNA clone 
has been introduced is cultured in an appropr iate cell culture medium under conditions that 
provide f<H:expresdonofHCV SNA or such HCV proteins by the oel^ Any of flie methods 

1 S previously descxibed finr Ae insertion of DNA fragments into a cloning vector may be used to 
construct expression vectors containing a gene consisting of appropriate 
transoiptiorial/faraiidational control signals and the These metibods 

may include in vitro recombinant DNA and synthetic techniques and in vivo recombination 
(genetic recombination). 

20 Expression of HCV variant RNA or protein may be controlled by any 

promoter/enhancer element known in the art, but these regulatory elements must be functional 
in the host selected for expression. Promoters which may be used to control esqnression 
include, but are not limited to, the SV40 early promoter region (Benoist and Chambon, 198 1 , 
Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous sarcoma 

25 virus (Yamamoto, et al„ 1980, Cell 22:787-797), the herpes ^ymidine kinase promoter 

(Wagner et oL, 1981, Proc. Natl. Acad. Sci. U.S A. 78:1441-1445), ttie regulatory sequences 
of the metallotfaionein gene (Brinster et aL, 1982, Nature 296:39-42); prokaryotic e^qiression 
vectors such as the P-lactamase promoter (Villa-Kamarof^ et al, 1978, Proc. Natl. Acad. ScL 
U.S A. 75:3727-3731), or the lac jwomoter (DeBoer, et al, 1983, Proc. Natl. Acad. Sci. 

30 U.S A. 80:21-25); promoter elemorts from yeast or other fimgi such as the Gal 4 promoter, 
the ADC (alcohol dd^drog^nase) promoter, PGrK ^hosphoglycerol kinase) promote, 
alkaline phosphatase promoter; and the animal transcriptional control regions, which exhibit 
' tissue specificity and have been utilized in traxisgenicaxiiinals: elastic 
^ch is active in pancreatic acinar cells (Swift et a/., 1984, Cell 38:639-646; Qmitz et aL, 

35 • 1986, Cold Spring Harbcn: Synq). Quant Biol. 50:399-409; MacDonald, 1987, Hq>atology 
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7:425-5 1 5); insulin gene control region which is active in pancreatic beta cells (Hanahan, 
1985, Nature 315:115-122), immunoglobulin gene controlregion which is active in lynq>hoid 
cells (Grosschedle/oiL, 1984, Cell 38:647-658; Adames 6^ oil, 1985, Nature 318:533-538; 
Alexander et al., 1987, MoL Cell. Biol. 7:1436-1444), mouse mammaiy tumor virus control 

5 region wUch is active in testicular, breast, b^lu>id and mast cells 1986, Cell 

45:485-495), albumin gene control region which is active in liver OPinkert et aL, 1987, Genes 
and Devel. 1 :268-276)» alpha-fetoprotein gene control region which is active in liver 
(ECrumlaufe/a/., 1985, MoL Cdl.BioL 5:1639-1648; Hammer e/ a/., 1987, Science 235:53- 
58), alpha l-antitrypsin gm control region which is active in the liver (Kelsey ei oL, 1987, 

10 Genes and Devel. 1:161-171), beta^globin gme control region which is active in myeloid cells 
^ogcam et al„ 1985, Nature 315:338-340; Kollias et oL, 1986, Cell 46:89-94), xoyelin basic 
protein g^e control region which is active in oligodendrocyte cells in tiie brain (Readhead et 
al., 1987, Cell 48:703-712), myosin lig^ diain-2 ^e control region which is active in 
skeletal muscle (Sani, 1985, Nature 3 14:283-286), and gimadotropic releasing hormone gene 

15 control region which is active in the hypothalamus (Mason et al., 1986, Sdeoce 234: 1372- 
1378). 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of flus invention. Useful expression vectors, for exapiple, 
may consist of segments of chromosomal, non-chromosomal and synflietic DNA sequences. 

20 Suitable vectors include draivatives of SV40 and known bacterial plasmids, e.g., E. coli 

plasmids col El, pCRl, pBR322, pMal-C2, pET, i>GEX [Smith et aL, 1988, G«ie 67:31-40], 
pMB9 and their derivatives, plasmids such as RP4; phage DNAS, e.g. , the numerous 
derivatives of phage X, ag., NM989, and o&er phage DNA, e.g., M13 and filamentous single 
stranded phage DNA; yeast plasmids such as the 2^ plasmid or derivatives thereof vectors 

25 useful in eukaryotic cells, such as vectors useful in insect or mammalian cells ; vectors derived 
Scorn combinations of plasmids and phage DNAs, such as plasmids that have been modified to 
employ phage DNA or oth^ expression control sequences; and tiie like known in tiiie art 
In addition to the preferred sequencing analysis, expression vectors containing an 
HCV variant DNA clone of die invention can be idmtified by four general approaches : (a) 

30 PCR ampUfication of flie deared plasmid DNA or specific miRNA, (b) nucleic add 

hyfaridization, (c) presence or absoice of selection maiker gene functions, (d) analysis with 
appro[xriate restriction endonucleases and (e)expressi(m of inserted sequffi In tiie first 
approach, the nucleic adds can be amplified by PCR to provide for detection of fte anqplified 
product In the second approadi,tiie presence ofnucldc acids in an expression vector can be 

3 5 detected by nucleic acid hybridization using probes conqirising sequences tiiat are 
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homologous to the HCV variant DNA. Iafhetibkdqyproach,fliei:ecombixm 
systmi can be identified and selected based iqxm the presence or absence of certain "selection 
marker" gene functions (e.g., p-galacfcosidase activity, ftymidme kmase activity, resistance to 
antibiotics, transformation phenotype, occlusion body formation in ba^ 
5 by the insertion of foreign genes in the vector. In the fourth approach, recombinant 

expression vectors are identified by digestion with appiopri ate restriction en^mes, Intfae 
fifth qiproach, recombinant expression vectors can be identified by assaying for the activity, 
biochemical, or immunological characteristics of the gene product expressed by the 
recombinant, eg., HCVRNA, HCV virions, or HCV viral protdns. 

10 For example, in a baculovinis expressicm ^sterns, both non-fiision'tFBnsfer vectors, 

such as but not limited to pVL941 {BamHl cloning sit^ Summers), pVL1393 (BoheHI, Anal, 
Xbal, EcoRl, Noil, JUndm, jBg/D, and Afl clmmg sit^ Imdtrogen), pVL1392 (JBglO, PsO, 
No A, ^Xmain, EcaRU Xbdi, SfndU and BaniHl cloning site; Summers and Invitrogen), and 
pBlueffodDDE (BomHI, BgfJI, Pstl^ Nco\ and Hindni cloning site, wifli blueAwfaite recombinant 

1 S screening possible; Invitrogoo), and fusion transfer vectors, such as but not limited to pAc700 
{BamHI and ^nl cloning site, in which the BaniHl recognition site b^ins with the initiation 
codon; Summers), pAc701 and pAc702 (same as pAc700, with different reading frames), . 
pAc360 (^£xmHI cloning site 36 base pairs downstream of a polyfaedrin initiation codon; 
Invitrogen(l 95)), and pBlueBacESsA, B, C (three different reading frames, with BamBl^ 

20 BglTL, PsfL, NcdU and HindUl cloning site, an N-tenninal peptide for FroBond purification, 
and blueAvhite recombinant soeenii^ of plaques; hivitrogen) can be used. 

Examples of maimnalian expression vectors contenq)lated for iise in tiie invention 
include vectors with inducible promotm, such as the dihydrofolate reductase (DHER) 
promoter, eg., any expression vectOT with a DHFR expression vectOT, or a 

25 DilFi^/methotrexate co-'anq)lificatiQn vector, such as pED {Pstl^ Sail, Sbdl, Smdly and EcoKL 
cloning site, with the vector expressing both tiie cloned gene and DHFR); [see Kaufinan, 
Ckarent Protocols in Molecular Biology, 16.12 (1991)]. Alternatively, a gjutamine 
synthetase/methionine sulfoximine co-mqplification vector, such as pEE14 (HindSH, XbdU 
SnuHy Sbdl, £coRI, and Bdl clouing site, in which the vector expresses glutamine synthase 

30 and tiie cloned gene; CelUech). b another embodiment, a vector that direc 

expression under control of %stdn Ban: Virus (EB V) can be used, such as piREP4 (BomHI, 
1^ JfAoI, NoO^ NhA^ OndSa^ Nh^ PviM, and Epnl cloning site, constitutive RSV-LTR 
~J promotor, h^ronQrcin selectable maikear, Invitrogen), pCEP4 (BamHl, Sfil^ Xhol, NofL^ Nhe^ 

HbtdSU NhA, PvuSL, and ijwil cloning site, constitutive hCMV immediate early gene, 

35 faygromycin selectable maikBi; Invitrogen), pMEP4 (A^thI, PvuL, Nkel^ Hindm^ NoO, XhdU 
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1^ BamHi cloning sh^ inducible metfaaUottuanein Ila gme promoter, hygromycin 
selectable maiker: lavitrogen), pREPS (JSamHU Xhol^ Not\ HmdHl^ NheL^ and I^nl cloning 
site, RS V-LTR promote, histidinol selectable marker; Invxtrogen), pREP9 (i^nl, Nhel^ 
Hindni, Noil, Xhol, 1^ and BamHI cloning site, RS V-LTR promoter, G41 8 selectable 

5 marker, Invitiogen), and pEBVIfis (RSV-LTR promoter, hygromycin selectable maiker, N- 
terminal peptide purifiable via ProBond lesin and cleaved by enterokinase; Ihvitrogen) . 
Regulatable ynftTpmaHflTi expression vectocs, can be used, such as Tet and iTet [Gossen and 
Bujaxd, Proa NaO. Acad. ScL USA 89:5547-51 (1992); Gossen ei oL, Saence 268:1766-1769 
(1995)]. Selectable mammalian expression vectors for use inlfae invention include pRc/CMV 

10 (HindOU BsiXl^ NoiU SbdU and Apal cloning site, G418 selection; Livitrog^), pRc/RSV 
{Hindnij SpA^ BstXl^ NoiL, Xbdi cloning site, 641 8 selectioI^ iavitrogen), and others. 
Vaccinia virus mammalian esqnession vectors [see, Eaufinan (1991) stq>rd\ for use according 
to the invention include but are not limited to pSCl 1 (SmdL cloning site, TK- and ^-gal 
selection), pMJ601 (SdtU SnuA, Afli^ Nari, BspMR, BaniHU Apal, Nhel, SacO, KpnX and 

15 HindOL cloning site; TK- and p-g^l selecticm), and pTKgptFlS [EcdBl, Ps(k SalU Accl^ 
HmdSL, SbdL, BamHi, and Hpa cloning site, IKor XPRT selection). 

Examples of yeast exinession systems include the non-fiision pYES2 vector (Abal, 
Sphl, ShoU Notl^ GstXI, EcdKL, BstXi, BamHI, SacU Kpn\ and Hindm cloning si^ 
Invitrogen) or the fusion pYESHisA, B, C (AZwJ, SpKL, ShoL, Noil, BstXl, EcoKl, BarriSL, 

20 Sacly Kpnl, and HindUL cloning site, N-tominal p^tide purified with ProBond resin and 

cleaved with enterokinase; Invitrogen), to mention just two, can be employed according to the 
invmtion. 

In addition, a host cell strain may be chosen that modulates the expression of the 
inserted sequences, or modifies and processes the gene product in the specific &shion desired. 

25 Diffeicait host cells have characteristic and specific mechanisms for the tmnslational and post- 
translational processing and modification (£Lg., glycosylation, cleavage [eg., of signal . 
sequence]) of jnroteins. Expression in yeast can produce a glycosylated product Eiqsression 
in eukaiyotic cells can inraease flie likelihood of "native" glycosylation and folding of an 
HCV protein. Moreover, eiqiression in mammalian cells can provide a tool f<X'ieccns^ 

30 or constituting, native HCV virions or virus particle proteins. 

A variety of transfecti<ni meAods, usefiil finr otiier KNA virus studies, can be utilized 
herein wifliout undue e xp erime M tatiop. Examples include microinjection, cell fusion, 
calcium-phosphate catimuc liposomes such as lipofectin [Rice et id.. New Biol 1:285-296 
(1989); see T[CV-based Gene Bxpiesdon Vectors'*, ir^d[, DE-dextran [Rice ei aL, J. Virol 

35 61: 3809-3819 (1987)], and elecHqpcnration [Biedenbeek ei al.. J. Virol 67: 6439-6446 
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(1993);Liljestrdmc/a/., /. VirvL 65:4107-4113(1991)]. Scrape loadmglKain^ 
Biochenu MoL Biol Int 32: 1059-1066 (1994)] and ballistic me&ods [Builcfaolder et al.. J. 
Immunol Meth, 165: 149-156 (1993)] may also be considered for cell ^es refiactory to 
transfection by fhese other methods. A DNA vector transporter may be considered [see, eg., 
5 Vfnei al., 1992, J. Biol. Chcm. 267:963-967; Wu and Wii, 1988, J. Biol. Chem. 263:14621- 
14624; HartmuteroiL, CanadianPatent Application No. 2,012^11, filed March 15, 1990]. 

In VUro Transfectian Wifli HCV Variants 
Ident^caiion of cell Unes stq^porting HCV replication. An important aspect of the 

10 invention is a method it inx)vides for derolc^ring new and more efi^^ 

confixing the ability to evaluate the efficacy of dififiexent thoapeutic strategies using an 
authentic and standardized in vitro HCV variant replication sfystem. Such assays are 
invaluable before moving on to trials using rare and valuable experimental animals, such as 
the chimpanzee, or HCV-infected human patients. The adaptive variants of the invention aie 

1 5 particulariy usefiil for ibis work because their growth in culture and their ability to withstand 
subpassage is superior to wild-type strains. Also, the replicons disclosed herein are useful 
because replication can be evaluated without the confounding effects of £he structural 
proteins. 

The HCV variant infectious clone technology can also be used to establish in vitro 
20 and in vivo systems for analysis of HCV replication and packaging. These include, but are 
not restricted to, (i) identification or selection of permissive cell types (for RNA replication, 
virion assembly and release); (ii) investigation of cell culture parameters (&g., varying culture 
conditions, cell activation, etc.) or selection of adaptive mutations ttat inorease the efficiency 
of HCV tq>lication in cell cultures; and (iii) definition of conditions for efficioit production 
' 25 ofiiifectious HCV variant particles (either released into the culture supmmtm 

after cell disruption). These and other readily apparent extmsionsofflie invention have bn>ad 
utility for HCV tiienqpeiitic^ vacciiie, and dt 

GeDerdq)proachesfo identifying pennissiveceU types are oull^ Optimal 
methods for KNA transfection (see also, stq^ra) vary wifli cell type and are determmed using 
30 SNA rqxmer constructs. These iiichide, for exanq>le,fliebicistroiiicrq)licons disclosed 
siqfra and in the Examples, and bicistronic virus [Wang et al, J. Virol 67: 3338-44 (1993)] 
with the structure 5'<!AT-HCVIRES-L(JC-3'. These HCV variants are used both to 
optimize transfecticm conditions (using, e.g., by measuring p-galactosidase or CAT 
[chloranqthenicol acetyhransferase] activity to determine trar&sfection efficiency) and to 
35 determine if tiie cell type is permissive for HCV IRES-mediated translation (e.g., by 
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measuring LUC; luciferase activity). For actual HCV RNA transfection e^qperiments, 
cotransfection Ynih a 5' capped lucifecase reporter RNA [Wang et aL, (1993) supra] provides 
an internal standard for productive transfection and translation. Examples of cell types 
potentially permissive for HCV iqilication include, but are not restricted to, primaiy human 

5 cells (e.g., hepatocytes, T-cells, B-cells, foreskin fibroblasts) as well as continuous human cell 
lines (eg., HepG2, Huh7, HOT78, HPB-Ma, MT-l, MT-IC, and other HTLV-1 and HTLV-H 
infected T-K^U lines, Namalawa,Daudi,EBV-transformedIXXs^ In addition, cell lines of 
other species, especially Aose ^ch are readily transfiscted widi KNA and permissive f or 
replication of flaviviruses or pestiviroses (eg., SW-13, Vero, BHK-21, COS, PK-IS, MBCK, 

10 etc.)» can be tested. Cells are transfected using a melhod as described Afi^pra. 

For rq;>Iication assays, RNA transcripts are prepared using the HCV variant and the 
corresponding non-fimctioDal, eg., AGDD (see Examples) derivative as a negative control, 
for persistence ofHCV RNA and antigm in die absence of productive repU TemplatB 
DNA (which comphcates later anatyses) is removed by repeated cycles of DNasel treatment 

IS and acid phenol extraction followed by purification by either gel electrophoresis or gel 

filtration, to preferably achieve less than one molecule of amplifiable DNA per 10^ molecules 
of transcript RNA. DNA-fi:ee RNA traiiscripts axe mixed with LUC reporter RNA and used 
to transfect cell cultures usmg optimal conditions determined above. After recovery of the 
cells, RNaseA is added to the media to digest excess input RNA and &e cultures incubated 

20 for various periods of time. An early timepoint (^^1 day post-transfection) will be harvested 
and analyzed for LUC activity (to verify productive transfection) and positive-strand RNA 
levels in the cells and siq)eniatant (as a baseline). San]ples are collected periodically for 2-3 
weeks and assayed for positive-strand RNA levels by QC-RT/PCR [see Kolyldialov et al., 
(1996) stq)ra]. Cell types showing a clear and reproducible difiereoce between the intact 

25 infectious transcript and tiie nonrfunctional derivative, e,g., AGDD deletion^ control can be 
subjected to more thorough analyses to vmfy authentic replicatiorL Such assays include 
measurem«it of negative-sense HCV RNA accumulation by QC-RT/PCR [Gunji et aL, 
(1994) supra\ Lanford et aL, Virology 202: 606-14 (1994)], Northem-blot hybridiTation, or 
metabolic labeling [Yoo et aL, (1995) siqrra] and smgle cell metiiods, such as in sUu 

30 iQTbridization [ISH; Gowans et aL, In "Nucleic Acid Probes" (R. EL Symons, Eds.), Vol. pp. 
139-158. CRC Piess» Boca Raton. (1989)], in situ FCR [foUowed by ISH to detect only HCV- 
specific anqilificationittoducts; Haase dL, Proc Natl Acad. ScL USA 87: 4971-4975 
(1990)], and inununohistochenusfxy. 

HCVparticlesforstud^gviruS'recqjtor interactions. In combination with flie 

35 identification of cell lines ftat axe permissive for HCV replication, defined HCV variant 
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stocks can be used to evalimte file inteniction of fheHCVm Assays can 

be set up which measure binding of fhe virus to suscqptible cells or productive infection, and 
&en used to sraeoa for inhibitors of tfiese processes. 

IdentiJicaHon of ceU lines for characterizati^ Cell lines 

5 permissive for HCV RNA rq>lication, as assayed by KNA transfection, can be screened for 
their ability to be infepted by die vinis using the HCV variants of fhe pr^ Cell 
lines permissive for KNA icplicatioii but which cannot be infected by the homologous virus 
may lack one or more host recqptors required for HCV binding and oitry. Such cells provide 
valuable tools for Q fimctional identification and molecular cloning of HCV recq;ytors and 

10 co-iecq>tQrs; (ii) characterization of virus-recq>tor interactions; and (iii) developing assays to 
screen for compounds or biologies {eg., antibodies, SELEX KNAs (Bartel and Szostak, In 
"RNA-piotein interactions" (JL Nagai and L W. Mattaj, Eds.), VoL pp. 82-102. IRL Press, 
Oxford (1995); Gold ei al., Anna. Uev. Biodienu 64: 763-797 (1995)], etc.) that inhibit Aese 
interactions. Once defined in this mauna-, Hiesc HCVrecq)tors serve not only as therapeutic 

1 5 targets but may also be expressed in transgenic animals rendmng dsem susceptible to HCV 
infection [Koike et oL, Dev Biol Stand 78: 101-7 (1993); Ren and Racaniello, J Virol 66: 
296-304(1992)]. Such transgenic animd models siipporting HCV rqpUcation and spr^ 
have important applications for evaluating anti-HCV drugs. 

The ability to manipulate the HCV glycoprotein structure inay also be used to create 

20 HCV variants with altered receptor sjpeci&<nty. In one example, HCV glycoproteins can be 
modified to express a heterologous binding domain for a known odlsur&cerecqy^ The 
approach should allow the engineermg of HCV derrvatrves with altered tropism and peihaps 
extend infection to non-chimeric small animal models. 

AUernaHvecgpproachesforidentily^ As previously discussed, 

25 and as exemplified in flieBxainples, fimctional HCY variants c^ 

selectable markers for HCV Tq>licaticiL For instance, gqoes encoding dominant selectable 
markers can be e?q)ressed as part of the HCV polyprotem, or as sqiarate cistrons located in 
perinissive regions of die HCV SNA j^nome. 

30 Animal Models for HCV hifection and Replication 

In addition to chimpanzees, the present inventim permits develqpment of alternative 
animal models for studying HCV replicatiGn and evaluating novel therapeutics. Usfaig clones 
of the authentic HCV variants desoibed in this invention as starting material, multiple 
approaches can be envisioned for establishing alternative animal models for HCV replication. 

35 In one manifestation, the variants could be used to inoculate immunodeficient mice harboring 
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human tissues capable of siq>portixig HCV replication. An exanq>le of this art is the SCIDrHu 
mouse» where mice wi& a severe combined immunodeficiency are mgrafied with various 
human (or chimpanzee) tissues, which could include^ but are not limited to^ fetal liver, adult 
liver, spleen, or peripheral blood mmionuclear cells. Besides SCED mice, normal irradiated 

S mice can serve as recipients for mgraffanent of human or chimpanzee These chimeric 

animals would then be substrates for HCV replication after either ex vivo or in vivo infection 
with defined virus-containing inocuhu 

In another manifestation, adaptive mutations allowing HCV replication in alternative 
qyedes may produce variants that are pennissive for rq>Uc8tiQn in For 

10 instance, adsqpftation of HCV for rqilication and spread in either continuous rodoit cell lines 
or prinuoy tissues (such as hqiatOGytes) could enable flie vims to replicate in small rodent 
models. Alternatively, complex libraries of HCV variants created by DNA shuflBing 
[Stemmer, Proa NatL Acad. ScL USA 91:10747 (1994)] or o&er methods known in the art 
can be created and used for inoculation of potentially suscq)tible animals. Such flnimalfg 

15 could be either immunocompetent or immunodeficient, as desmbed above. 

Ihe functional activity of HCV variants can be evaluated transgenically. In this 
respect, a transgenic mouse model can be used [see, e.g.^ Wihnut et al,, Experientia 47:905 
. (1991)]. The HCV RNA or DNA clone can be used to prepare transgaiic vectors, includii^ 
viral vectors, plasmid or cosmid clones (or phage clones). Cosmids may be introduced into 

20 transgenic mice using published procedures [Jaenisch, Science^ 240:1468-1474 (1988)]. In 
fbG preparation of transgenic mice, embryonic stein cells are obtained from blastocyst 
embryos [Joyner, Jn Gene Targeting: A Practical Approach. The Practical Approach Series^ 
Rickwood, D., and Hames, B. D., Eds., IRL Press: Oxford (1993)] and transfeded with HCV 
variant DMA or SNA. Ihtnsfectedcelk are iigectedinto early embryos, e.g., mouse 

25 embryos, as described [Hammer et aL, Nature 315:680 (1985); Joyner, ^pra]. Various 
techniques for prqiaration of transgenic animals have been desmbed [U.S. Patent No. 
5,530,177, issued June 25, 1996; U.S. Patent No. 5,898,604, issued December 31, 1996]. Of 
particular Interest are transgenic ahimal models in which the phraotypic or paAogenic effects 
of a transgene are studied. For exanq>le,flie effects of a nrt;phosphoeno^>ynivate 

30 caxboxykinase-bovine growth hormone fusion gene has been studied in pigs [Wiegjiart et aL , 
J. Reprod Pert., SuppL 41:89-96 (1996)]. Transg^c mice tibat express of a gene encodmg a 
human amyloid precursor protein associated with Al2lieimer's disease are used to study this 
disease and other disc^ders [International Patent Publication WO 96/06927, published March 
7, 1996; Quon et al.. Nature 352:239 (1991)]. Transgoiic mice have also be^ created for flie 

35 hepatitis delta agent [Polo et al.. J. Virol 69:5203 (1995)] and for hq>atitis B virus [CSiisari, 
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Curr. Top. Microbiol Immunol 206:149 (1996)], and rq)lication occurs in tfiese engmeeied 
ammals. 

Thus, file functional HCV variants described here, or parts hereof, can be used to 
create transgenic models relevant to HCV replication and pathogenesis. In one example, 

S transgenic animals harboring the mttre genome of an HCV varim^ 

Appropriate constructs for transgenic expression of the entire HCV variant genome in a 
transgenic mouse of the invmtion could include a nuclear promoter engineered to produce 
transcripts with the appropriate 5' terminus, the full-length HCV variant cDNA sequence, a 
cis-cleaving delta ribozyme [Ball,/. Virol 66: 2335-2345 (1992); Pattnaikc^fli. Cdl 69: 

10 1011-1020 (1992)] to produce an authentic 3' terminus, followed possibly by signals &at 
promote proper nuclear processing and transport to the cytoplasm (where HCV RNA 
replication occurs). Besides tiie oitire HCV variant genome, animals can be engineered to 
express individual or various combinations of HCV proteins and RNA elonents. For 
example, animals engine^ed to ^qyress an HCV gene product or rqjorter gene xmdCT the 

1 5 control of the HCV IRES can be used to evaluate therapies directed against &is specific RNA 
target Similar animal models can be envisioned for most known HCV targets. 

Such alternative animal models are useful for (i) studying the efTects of difierent 
antiviral agents on replication of HCV variants, including replicons, in a wtole animal 
system; (ii) examining potential direct cytotoxic effects of HCV gene products on hepatocytes 

20 and other cell types, defining the underlying mecbanisms involved, and identifying and 
testing strategies for therapeutic intervention; and (iii) studying immune-mediated 
mechanisms of cell and tissue damage relevant to HCV patiiogotiesis and identifying and 
testing strategies for interferii^ witii tiiese processes. 

25 Selection and Analysis of Dn^-Bcsistant Variants 

Cell lines and animal models suppOTtmg HCV rq)lication can be used to examine llie 
emergence of HCV variants wifli resistance to eadsting and novel therapeutics. LikeallRNA 
viruses, the HCV replicase is pnesumed to lack proofireading activity and RNA replication is 
tfamfoieenpor prone, giving rise to a faig^ level ofvariationOBukh 6^^^^ Tlie 

30 variability manifests itself in tiie infected patient over time and in tiie oonsideratble diversity 
observed between different isolates. The emergence ofdrug-resistant variants is likely to be 
an inqnyrtantconsideratiim in tbfi design and evaluation of HC^ mono and condrination 
therapies. HCV replication systems oftiieinv^tion can be used to study the emergence of 
variants under various therapeutic fmmulations. These might include monother^y or various 

35 combiimtion therapies (eg:, IFN-<x,ribaviriii, and new antivnalcoii^ Resistant 
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mutants can Hkca be used to define Ifae molecular and structural basis of resistance and to 
evaluate new therapeutic fimnulations, or in screening assays for effective anti-HCV drugs 
iinfra). 

' 5 Screoung For Anti-HCV Agents 

HCV-permissive cell lines or animal models (preferably rodent models) comprising 
adaptive HCV variants can be used to smen for novel ixihibitars or to evaluate candidate anti- 
HCV &0rapies. Such flifflapies include, but would not be limited to, (i) antisense 
oligonucleotides or ribozymes tadgeted to conserved HCV SNA targets; (iO injectable 

10 compounds capable of inhibiting HCV rq)licatiQn; and (hi) anSiy bioavailable compounds 
capable of inhibiting HCV replication. Targets f<x such formulations include, but are not 
restricted to, (i) conserved HCV SNA elements inqxntant for KNA rqdication and SNA 
packaging; (iO HCV-encoded ens^e^ (iu) protein-protein and protein-SNA interactions 
important for HCV SNA replication, virus assembly, virus release, viral receptor binding, 

1 5 viral entcy, and niitiation of viial SNA replication; (iv) vnus-host intmctions modulating Hie 
ability of HCV to establish chronic infections; (v) virus-host interactions modulating the 
severity of liver damage, including factors affecting apoptosis and hqpatotoxicity; (vi) virus- 
host interactions leading to the development of more severe clinical outcomes including 
cinbosis and hepatocellular carcinoma and (vii) virus-host interactions resulting in other, less 

20 frequent, HCV-associated human diseases. 

Evaluation of antisense and ribo2yme therapies. The i)resent invention extoids to the 
preparation of antis^ise nucleotides and ribozymes &at may be tested for the ability to 
interf^ with HCV replication. This approach utilizes antisense nucleic acid and ribozymes 
to block translation of a q)ecific mSNA, d&er by masking &at mSNA with an antisense 

25 nucleic acid or cleaving it with a ribozyme. 

Antisense nucleic acids are DNA or SNA molecules that are complementaiy to at 
least a portion of a specific mSNA molecule. Reviews of antisense technology include: 
Baertscbi, Mol Cell Endocrinol 10i:RlS-R24 (1994); Crooke et aL, AnmL Rev, Pharmacol 
Toxicol 36:107-129 (1996); Alama et aL, Pharmacol Res. 36:171-178; and Boyer et al., /. 

30 Hepatol 32(1 Sappl):98-1 12(2000). The last review discusses antisense technology as it 
applies to HCV. 

In the cell, Ibey hybridize to that niSNA, finming a double stranded DNA:SNA or 
SNA:SNA molecule. IlieceU does not translate an mSNA in flus double-stranded form. 
Therdbre, antisense nucleic acids interfisre with flie expression of mSN A into protein. 
35 Oligomm of about fifteen nucleotides and molecules that hybridize to the AUG initiation 
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codan will be particularly efScien^ since Ibey are easy to synfliesize and are likely to pose 
fewer problems dm lai:ger molecules ^0riienintn>duci]^ Antisense 
methods have been used to inhibit file expression of maxiy genes in vi'^ Preferably synthetic 
antisense nucleotides contain pho^hoester analogs, such as phosphorothiolates, or fhioesters, 

5 ralh^fiian natural phophoester bonds. Such phosphoester bond analogs are more resistant to 
degradation* increasing the stabflity, and Iherefore &e efficacy, of the antisense nucleic acids. 

In the genetic antisense approach, eqxressimi of flie wild-type allele is suppressed 
because of repression of antisoQseRNA. Thtetedmique has been used to inhibit TK 
synttiesis in tissue culture and to jsoduce phenotypes of the Kng>pel mutation in DrosophUa^ 

10 and the Shiverer mutation in mice [Izant ei aL, CeU^ 36:1007-1015 (1984); Green ei oL, Anmu 
Rey. Biochem., 55:369-597 (1986); Katsuki et aL, &ience, 241:593-595 (1988)]. An 
important advanb^ of this iq[q[)road& 

ejqjressed for dffectiye inhibition of e3q)ressiQn of the entire cc^^ Theantisense 
transgene will be placed under control of its own promoter or anodio: promoter oqiressed in 

15 flie correct cell type* and placed ij^stream of the SV40 polyA site. 

Ribozymes are EKA molecules possessing die ability to specifically cleave other 
single stranded SNA molecules in a manner somewhat analogous to DNA restriction 
endonucleases. Ribozymes were discovered fiom the observation that certain mRNAs have 
the ability to CTcise their own introns. By modifying the nucleotide sequence of these KNAs, 

20 researchers have been able to oigineer molecules that recognize specific nucleotide sequences 
in an RNA molecule and cleave it Recent reviews include Shippy et al., Mol BioiechnoL 
12:117-129 (1999); Schmidt, M>1 CeZfa 9:459-463 (1999);Pltylactouetal.,Mctfc. EnzymoL 
313:485-506 (2000); Oketani et aU J. HepatoL 31:628-634 (1999); Macejak et al., 
Hepatology 31:769-776 (2000). The last two refaoices disclose the use of ribozymes for 

25 inhibiting HCV. Becai&elhey are sequeaKe^qpecific, only mRNAs wi& 
are inactrvated. 

InvestigatorB have identified two types of xibo^mes, Tetnxhym&ta-typc and 
"hamma:head"-1ype. TeirakymenaAypcribo^^ 

"hammerfaead"-type recognize eleven- to eig^teen-base sequences. The longer the 
30 recognition sequence, the more likely it is to occur exclusively in the tax^ mRNA species. 

Therefore, hamineiiieadrlyperibozynies areprefie^ to Tetrahymena-typG ribozymes for 

inactivating a spedfic mRNA species, and dg^bfteeai base recognition sequences are preferable 

to shorter recognition sequences. 

Screening con^poundUbrwiesfw'anti-HC^ Various natural product or 

35 synthetic lilxcaries can be screened for and-HCV activity in in vitro or in vrvo models 
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comprising HCV variants as provided by ttie invention. One approach to preparation of a 
combinatorial library uses primarily chemical methods^ of which the Geysen method [Geysen 
et al.. Molecular Immunology 23:709-715 (1986); Geysen et alJ. Immunologic Method 
102:259-274 (1987)] and the method of Fodor et al.[Science 251:767-773 (1991)] are 
S examples. Fuxka et aL[14ih Intematioiud Congress of Biochemistry, Volume 5, Abstract 
FR:013 (1988); Furka^/nt 7. Peptide Protein Res. 37:487-493 (1991)]> Houghton [U.S. 
Patent No. 4,631^1 1, issued December 1986] and Rutter et fl/.[U.S. Patent No. 5,010.175, 
issued April 23, 1991] describe melhods to produce a mixture ofpeptides&at can be tested 
for anti-HCV activity. 

10 Inanolher aspect, synthetic libraries [Needels e/a/.« Proc. NatL Acad. Sd. USA 

90:10700-4 (1993); Ohhneyer et aL, Proa NatL Acad Sd. USA 90:10922-10926 (1993); Lam 
et aL, International Patent Publication No. WO 92/0025% Kocis et al., lutemational Patent 
Publication No. WO 9428028], and flie like can be used to screen for anti-HCV compounds 
according to the present invention. Tbe references describe adaption ofthe library screening 

15 techniques in biological ass^. 

Defined/engineered HCVvariant virus particles for neutralizatio The 
variants described herein can be used to produce defined stocks of HCV particles for 
infectivity and neutralization assays. Homogeneous stocks can be produced in the 
chimpanzee model, in cell culture systems, or using various heterologous eTcpression systems 

20 (eLg.,baculovirus, yeast, xnainmalian cells; see jrt^pni). These stocks can be used in cell 
culture or in vivo assays to define molecules or gene tfaer;^ approaches capable of 
neutralizing HCV particle production or infectivity. Examples of such molecules include, but 
are not restricted to» polyclonal antibodies, monoclonal antibodies, artificial antibodies with 
engineered/optimized specificity, singleH:hain antibodies (see tiie section on antibodies, 

25 infra), nucleic acids or derivatized nucleic acids selected for specific binding and 

neutralization, small orally bioavailable compounds, etc. Such neutralizing agents, taigeted to 
conserved viral or cellular targets, can be dtiier goiotype or isolate-spedfic or broadly cross- 
reactive. Ibey could be used eiflierprophylacticaUy or for x)assiveimmunother^ 
viral load and perh^s increase tiie chances of more effective treatment in combination with 

30 other antivird agents (eg., IFN-Oy ribavirin, etc.). Directed manipulation of HCV infectious 
clones can also be used to produce HCV stocks witii defined changes in the gtycoprotein 
faypervariable regions cht in o&er epitopes to stuc^ mechanisms of antibody neutralization, 
CFL recognition, immune escqie and immune enhancement These studies will lead to 
identification of other vinis-qiecific fimctions for anti-viral therapy. 

35 
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Dissection of HCV Replication 
OAer HCV replication assays. This invention allows directed molecular genetic 
dissection of HCV replication. Such analyses are expected to (i) validate antiviral targets 
which are currentiy being pursued; and (ii) uncover uneaqpected new aspects of HCV 

5 replication amenable to therapeutic intervention. Targets for immediate validation tiirough 
mutagenesis studies include the following: tiie 5' NIR, tiie HCV polyprotein and clea^ge 
products, and tiie 3' NTR. As described above, analyses using the HCV variants and 
pennissive cell cultures can be used to compare parental and mutant replication phenotypes 
after trans&ctionofceU cultures witii infectious RNA. Even though RT-PCR allows 

10 sensitive detection of viral RNA accumulation, niutations which decrease tiie efiSdency of 
RNAr^Ucation may be di£Scult to analyze, unless conditional mutations are r^ Asa 
conqilement to first cycle analyses, l!raits-conQ)lementation assays can be used to fecilitate 
analysis of HCV mutant phenotypes and inhibitor screening. Chimeric variants comprising 
portions of het^logous systems (vaccinia, Sindbis, or non-viral) can be used to drive 

15 expression of the HCV RNA repUcase proteins and/or packaging machine [see Lenmi and 
Rice, y. ViroL 67: 1905-1915 (1993a); Lcmm and Rice, /. ViroL 67: 1916-1926 (1993b); 
Lemmc/a/.,£WBOJ. 13: 2925-2934 (1994); Li a/., 7. Fzroi 65:6714^723(1991)]. If 
tiiese elements are capMc of functioning in trans, then co-expression of RNAs witii 
appropriate m-elements should result in RNA replication/packaging. Such systems therefore 

20 mimic steps in auflientic RNA repUcation and virion assembly, but imcouple production of 
viral components firom HCV rq)lication. If HCV replication is somdiow self-limiting, 
heterologous systems may drive significantiy higher levels of UNA replication or particle 
production, &cilitating analysis of mutant phenotypes and antiviral screening. A tinrd 
approach is to devise ceD-fiee systems for HCV template-<]q)endent RNA re^ A 

25 coiq>led translation/replication and assembly system ha& been described for poUovirus in 
HeLa cells [Barton and Flanegan, J. VmL 67: 822-831 (1993); MoUa et aL, Science 254: 
1647-1651 (1991)], and a tenq>lat&<lependeat ui vUro assay for initiation of negative-strand 
synthesis has been establisliedfiir Sindbis virus. Suziilar in virsno systems using HCV variants 
are invaluable for studying many aspects of HCV replication as well as for inhibitor screening 

30 and evaluation. An example ofeadboftihese strategies follows. 

Trans-coniplemeniationcfHCVJRNA replication and/or packaging using viral or 
non-viral expressim systems. Heterologous systems can be used to drive HCV rq>lication. 
For example, tiie vaccmia/T7 cytoplasmic e)qiression system has been exlrmety useful for 
trans-conq>lementation of RNA virus rq>licase and packaging functions [see Ball, (1992) 

35 siqpra; Lemm and Rice, (1993a) supra; Lemm and Rice, (1993b) supra; Lemm et dL, (1994) 
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suprai Pattnaik et al, (1992) w^pra; Patliiaik et al., Virology 206: 760-4 (1995); Porter cT a/., 
J. Virol 69: 1S48-1SSS (1995)]. In faridT, a vaocinia recombinant (vTF7-3) is used to express 
T7KNA polymerase (TTRNApol) in Ihe cell type of intere^ Target cDNAs, positioned 
downstream fixnn the T7 promoter, are deliveied either as vaccinia recombinants or by 

5 plasmid transfectiorL This system leads to high level KNA and protdn expression. A 
variation of fliis approach, vitaixAi obviates the need for vaccinia (^ch could interfere widi 
HCV KNA replication <»: virion fonpationX is the pT7T7 system where Ihe T7 promoter 
drives expression of TTRNApol [€3iea et oL, Nudeic Acids JRes. 22: 21 14-2120. (1994)]. 
pTJTJ is mixed with T7E(NApol (the protein) and co-transfected wifli the T7-driven target 

10 plasmid of intoest Added T7RN^)ol initiates transcription, leadiiig to it own producti 
and high level expression ofthe target gene. Using dtfaorappiDachyRNA transcripts of 
variants with precise 5' and 3' tenniiu can be produced usmg die T7 transcri^ 
and tbe cis-cleaving HCV ribozyme (Rz) (30 [Ball, (1992) supm; Pattnaik ^ oL, (1992) 
jn^ra]. 

1 5 These or similar e?qnession systems can be used to establish assays for HCV RNA 

replication and particle formation using HCV variants, and for evaluation of cooQiouiids 
which might inhibit diese processes. T7-driven protein expression constructs and fidl4engfh 
HCV varimts incorporating die HCV ribo2yme following the 3' A 
typical experimental plaii to validate the assay as described for pT7T7, aldioug^ essentially 

20 similar assays can be envisioned using vTF7-3 or cell Unes expressing the T7 KNA 
polymerase. HCV-permissive cells are co-transfected with 

pT7T7+T7RNApol4p90/HCVFLlorig pU Rz (or a negative control, such as AGDD). At 
different times post-transfection, accumulation of HCV proteins and SNAs, driven by the 
p 17 17 system, are followed by Westem and Northem blotting, respectively. Toassayfor 

25 HCV-specific replicase function, actinomycin D is added to block DNA-<lq)endent T7 

transcription [Lenun and Rice, (1993a), stqird] and actinomycin D^esistant RNA synthesis is 
monitored by metabolic labeling. Radioactivity will be incorporated into full-length HCV 
KNAs for p90/HCVFL long pU/Rz, but not for p90/HCVFLAGDD/E^ Using HCV variants 
of the invention, this assay system, or elaborated derivatives, canbe \ised to screen for 

30 inhibitors and to stuc^ tiieir effects on HCV RNA replication. 

CdUfree system for assaying HCV rq>lication and Cell-fiee 
assays for studying HCV RNA replication and inhibitor screening can also be established 
using the variants described in this invention. Eiflier virion or transcribed RNAs are used as 
substrate RNA. For HCV, fulHengtii HCV variant RNAs transcribed m vitro can 

3 5 program such in vitro systems and replication assayed essentially as described for poliovirus 
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[see Barton et al, (1995) supra], bi case hepatocyte-specific or othor fectors are required for 
HCV variant RNA leplication, the system can be sqiplmenled with hepatocyte or other cell 
extracts, or alternatively, a comparable system can be established usmg cell lines which have 
been shown to be permissive for replication of flie HCV variants. 

S One concern about this qyproach is fbaX proper cell-fiee synfliesis and processing of 

the HCV polyprotein must occur. Sufficimt quantities of properly processed rqilicase 
conq>onents may be di£Bicutt to produce. To circumvent this problem, the T7 e3q>ression 
system can be used to express hig^ levels of HCV replicase coniponents in appropriate cells 
[see Lemm et al, (1997) ^ifira]. PIS membrane fractions from fliese cells (with added 

1 0 buffer, Mg^, an ATP regenerating system, and NIPs) diould be able to initiate and 

syn&esize fidl-lengdi npgative-strand BNAs upon addition of HCV-«pedfic template KNAlS. 

Establidmimt of either or bo& of fte above assacys aUows rapid and 
of the effects of HCV mutations, host fiKtm, involved in replication and inhibitors of flie 
various steps in HCV RNA replication. These systems will also establish the requirements 

IS for helper systems fi9rprq>artngrq[>UcatiQnTdefident HCV vecte 

Vaccination and Protective Immunity 
There are still many unknown parameters fliat impact on development of effective HCV 
vaccines. It is clear in both man and the chinQ>an2ee that some individuals can clear the 

20 iofection. Also, 10-20% of those treated wi&IFN or about twice this percentage treated wi 
IFN and ribavirin show a sustained refuse as evidenced by lack of circulating HCV RNA. 
O&CT studies have shown a lack of {HDtective immimity, as evidenced by successful 
reinfection wi& bo& homologous virus as well as widi more distantly related HCV types 
[Faici et al., (1992) suprar^ Prince et al, (1992) stqn-a]. Nonetheless, chimpanzees immunized 

2S with subunit vaccines consisting of E1E2 oligomers and vaccinia recombmants expi^^ssmg 
these proteins are partially protected against low dose challenges [Choo et al., Proc Nail. 
Acad. ScL USA 91:1294 (1994)]. Ibe HCV variant technology described in tibis invention has 
utility not only for basic studies aimed at understanding &e nature of protective immune 
responses against HCV, but also fixr novel vaccine production methods. 

30 Active immunity against HCV can be induced by immunization (vaccination) with an 

immunogenic amount of an attenuated or inactivated HCV variant virion, or HCV virus 
particle protrins» preferably wift an immunologically effective a^^ An 
"immunologically effective adjuvant" is a material tiiat enhances the immune response. 
Selection ofanaiSuvantdq)end8 cm tbe subject to be vacci^ Preferably,a 

3S pharmaceuticallyaccqitablea^uvantisused. For example, a vaccine for a human should 
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avoid oil or bydrocarbon raiulsion adjuvants, including complete and mconq)lete Freund's 
adjuvant One example of an adjuvant suitid>le for use with huniam is alum (alumm A 
vaccine for an animal, however, may contain adjuvants not ^appropriate for use with humans . 
An alternative to a traditional vacdne comprising an antigen and an adjuvant involves 
5 the direct in vfvo introduction of DNA or RNA encoding tiie antigen into tissues of a subject 
for expression ofthe antigen by tile cells of&esubjed^s tissue. Such vaccines are termed 
herein genetic vaccines, DNA vaccines, genetic vaccination, or nucldc acid-based vaccines. 
Mediods of transfection as described above, such as DNA vectors or vector transporters, can 
be used for DNA vaccines. 

10 DNA vaccines are desmbed, e.g^ in Inteniational Patent Publication WO 95/20660 

and Intmnational Patent Publication WO 93/19183, the disclosures of which are herdiy 
iiicoiporatedl^^ reference in their entiled Theabilityofdirecdy injected DNA that 
encodes a viral protein or genome to elicit a protective immune response has been 
demonstrated in numCTOus expoimiental systems [Conry et oL, Cancer Res,, 54:1164-1168 

15 (1994); Cox et oL, nroU 67:5664-5667 (1993); Davis et oL, Hum. Mole. Genet, 2:1847-1851 
(1993); Sedegah et aL, Proa Natl Acad. ScL, 91:9866-9870 (1994); Montgomery et al., DNA 
Cell Bio., 12:777-783 (1993); Uhner et aL, Science, 259:1745-1749 (1993); Wang et al, 
Proc. Natl Acad. Sci., 90:4156-4160 (1993); Xiang et al.. Virology, 199:132-140 (1994)]. 
Studies to assess ttiis strategy in neutralization of influenza virus have used both envelope and 

20 internal viral proteins to induce the production of antibodies, but in particular have focused on 
the viral hemagglutinin protein (OA) [Fynan et al, DNA Cell Biol, 12:785-789 (1993 A); 
Fynan et al., Proc. NatL Acad ScL, 90: 1 1478-1 1482 (1993B); Robinson et oL, Vaccine, 
11:957, (1993); Webster cTo/L, Vaccine, 12:1495-1498 (1994)]. 

Vaccination fliroug^ directly inj ecting DNA or RNA fliat encodes a ]m>tein to elicit a 

25 protective inmiuneieq)onsepn)duces both ceU-mediated and humoral This is 

analogous to results obtained wifli live viruses [Raz et al, Proc Natl Acad. ScL, 91:95 19- 
9523 (1994); Uhnw, 1993, st^ra; Wang, 1993, supra; Xiang, 1994, stq^ra]. Studies witti 
feiiets indicate that DNA vaccines against conserved intonal viral proteins of influenza, 
together wifli sur&ce glycoproteins, are more effective against antigenic variants of influenza 

30 virus than are either inactivated or subviriou' vaccines [Donnelly et al, NatJ^edidne, 6:583- 
587(1995)]. hideed, reproducible insnnme responses to DNA encodiiigni^^ 
been reported in mice ttat last essentially for the lifetnne of tiheanii^ [Yankauckasef ail, 
DNA CellBioL, 12: 771-776 (1993)]. 

A vaccine of the invention can be administered via any parenteral route, including but 

35 not limited to intramuscular, intraperitoneal, intravenous, intraarterial (eg:, Ripatic artery) 
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and die like. Piefmbly, since the desired resdt of vacciimtim 
response to HCV, administration direcdy, or by targeting or cboice of a viral vector, 
indirectly, to lymphoid tissues, eg., fymphnodes or spleen. Since immune cells are 
continually replicating;, they are ideal target for retroviral vector-based nucleic acid vaccines, 

5 since retn>viruses require replicating cells. 

Passive immunity can be confened to an animal subj ect suspected of suffering an 
infection with HCV by administering antiserum, neutralizing polyclonal antibodies, or a 
neutraliang monoclonal antibody against HCV to the patient Although passive immunity 
does not confo: long-term protection, it can be a valuable tool for tiie tieatmaxt of an acute 

10 infection of a subject who has not been vaccinated. Preferably, the antibodies administered 
lor passive unmune ti»iqyy are autologous antibodies. For example, iftiie subject is a 
human, preferably the antibodies are of human origin or have been "humanized," in order to 
minmm tfift posariKilffy afsm immiine tesponse agaitist the antibodies. In addition, genes 
encoding neutralizmg antibodies can be introduced in vectors for expression in -mo, e.g. , in 

IS hepatocytes. 

Antibodies for passive imrmmethenqjy. Preferably, HCV variant virions or vans 
particle proteins prepared as described above are used as an immunogen to generate 
antibodies that recognize HCV. The variants utilized should have wild-type coat Such 
antibodies include but are not limited to polyclonal, monoclonal, chimeric, single chain. Fab 

20 fragments, and an Fab expression lihraiy. Various procedures Imown in the art may be used 
for the production of polyclonal antibodies to HCV. For ttie production of antibody, various 
host aTiimalR can be immunized by injection witibi the HCV virions or polypeptide, &g., as 
describe m^/i, including but not limited to rabbits, mice, rats, sheep, g Various 
adjuvants may be used to increase tbe immunological response, depending on the host 

25 species, including but not limited to Freund*s (conq>lete and incomplete), min^ gels such as 
aluminum hydroxide, sur&ce active substances such as lysolecitiiin, pluronic polyols, 
polyanions, peptides, oil ennilsions, keyhole limpet hemocyanins, dinitrophenol, and 
potentially useful human adjuvants such as BCQ (padlle Calmette-Guerin) mi 
Corynebactenum parvum. 

30 For preparation ofmonoclond antibodies directed toward HCV as described aboi^ 

any technique that provides for the production of antibody molecules by continuous cell lines 
in culture may be used, lliese include but are not limited to the faybridoma technique 
originally developed by KoUer and Milstem {Nature 256:495-497 (1975)], as well as the 
trioma technique, the human B-cell hybridoma technique [Kozbor et aL^ Immunology Today 

35 4:72 1983); Cote et oL, Proa Natl Acad, ScL U,SjL 80:2026-2030 (1983)], and die EBV- 
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hybridoma technique to produce human mosioclonal antibodies [Cole eiaL,m Monoclonal 
Antibodies and Cancer Thengjy, Alan IL Liss» Inc., pp. 77-96 (198S)]. In an additional 
embodiment of the invention, monoclonal antibodies can be produced in germ-fiee animals 
(Ihtmiational Patent Publication No. WO 89/12690, published 28 December 1989]. In &ct, 

5 according to the invention, techniques developed for tihe production of "chimeric antibodies** 
(Morrison et al., J. BacterioL 159:870 (1984); Neubergo: et al.. Nature 312:604-608 (1984); 
Takeda et aL, Nature 314:452-454 (1985)] by slicing tibe genes from a mouse antibody 
molecule specific for HCV together with g^es fiom a human antibo^ mol ecule of 
appropriate biological activity can be used; such antibodies are within the scope of this 

10 invention. Such hmnan or humanized chimeric antibodies are preferred for use in therapy of 
human diseases or disordens (described i/^/hi), since the human or humanized antibodies are 
much less likely ftan xenogemc antibodies to induce an immune lespon 
allergic response, themselves. 

According to the invention, techniques described for the production of single chain 

15 antibodies [U.S. Patent Nos. 5,476,786 and 5,132,405 to Huston; U.S. Patent 4,946,778] can 
be adapted to produce HCV-^pecific sin^e chain antibodies. An additional embodiment of 
the invention utilizes the techniques described for the construction of Fab expression libraries 
(Huse et al„ Science 246: 1275-1281 (1989)] to allow rapid and ea^ identification of 
monoclonal Fab fiagmients with tiie desired specificity. 

20 Antibody fragments containing the idiotype of tiie antibody molecule can be 

generated by known techniques, F<Mr example, such fi:agments include but are not limited to: 
the F(ab02 fi:agment which can be produced by pepsin digestion of the antibody molecule; the 
Fab' firagments which can be generated by reducing the disulfide bridges of tiie F(ab')2 
firagmoit, and the Fab fi:agments "vAidi can be generated by treating tiie antibo(fy molecule 

25 witii papain and a reducing agent 

HCVpartidesfi^subunitvacdnatioTL Ibefunctiorial HCV variants of the present 
invention can be used to pn>duceHCV-likB particles for vaccim Proper g^cosylation, 
folding, and assembly of HCV particles may be inqxirtant for imkhictng appr o pri ately 
anti^nic and protective subunit vaccines. Several metiiods can be used for particle 

30 production. They include engineering ofstableceUliiies for inducible or constitutive 

expression of HCV -like particles (using bacterial, yeast or mammalian cells), or tiie use of 
higher level eukaryotic heterologous oqxression systems such as recombinant baculoviruses, . 
vaccinia viruses {Moss, Proa Natl Acad. ScL U.&A. 93: 1 1341-1 1348 (1996)], or 
alphaviruses {Frolov et aL, (1996) stq^m]. HCV particles for immunization may be purified 

35 fiom eitiier the media or disrupted cells, dq;)endingiqpontiieirlocalizatioii. Suchpurified 
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HCV particles or mixtures of i>aiticles^]^^ genotypes, can be 

injected with our witibout various adjuvants to oihance immunogenidty. 

Infecthnismn-replicatingHCVpar^ la another numifestation, particles of HCV 
variants capable of lecqptor binding, entry, and translation of genome RNA can be produced. 
5 Heterologous expression approaches for production of such particles include, but are not 
restricted to, E. coli^ yeast, or mammalian cell lines, appropriate host cells infected or 
harboring recombinant baculoviruses, recombinant vaccinia viruses, recombinant 
alphaviruses or RNA rqilicons, or recombinant adenoviruses, mgmeered to express 
appropriate HCVRNAs and proteins, bi one example, two recombinant baculoviruses are 
10 engineered. One baculovinis eqnresses the HCV structural proteins (e.g. C-Bl-E2-p7) 
required for assembly of HCV particles. A second reccmibiiiant expresses the ientire HCV 
genome RNA, with precise S' and3' ends, excqit that a deletion, sudb as AGDD or 
ODD-^AAG (see example IX is inchided to inactivate the HCVNSSB RDRP. Other 
mutations abolishing productive HCV. replicaticm could also be utilized instead or in 
IS combination. Cotransfection of appi t ipai ate host cells (SO, SfZl, eto.) wifli both 

recombinants will produce high levels of HCV structural proteins and genome SNA fm* 
packaging into HCV-like particles. Such particles can be produced at hig^ levels, purified, 
and used for vaccination. Once introduced into tiie vaccinee, such particles will exhibit 
normal receptor binding and infection of HCV-susceptible cells. Entry will occur and flie 
20 genome RNA will be translated to iHoduce all ofthe normal HCV antigens, except that 
further replication of tiie genome wiU be completely blocked given the inactivated NS5B 
polymerase. Such particles are expected to eUcit effective CTL responses against stnictural 
and nonstructural HCV protein antigens. This vaccination strategy alone or preferably in 
conjunction with the subunit strategy described above can be used to elicit high levels of both 
25 neutralizing antibodies and CJiX, responses to help clear die virus. A variety of different HCV 
genome RNA sequences can be utilized to ensure broadly cross-reactive and protective 
immune responses. In addition, modification of the HCV particles, either through genetic 
engineoing, or by derivatization in yitrOy could be used to target iafection to cells most 
effective at eliciting protective and long lasting immune responses. 
30 Live--attemiaiedHCVdmvaiives. Tbe ability to manipulate the HCV g^ome RNA 

sequence and Ifaerdiy produce mutants with altered patfaogaaicity provides a means of 
constructing Uve-attenuated HCV variants iqpprcp^ Sudi vaccine 

candidates express protective antigens but would be impaired in tiieir ability to cause disease, 
establish chronic infections, trigger autoimmune responses, and transfimn cells. 
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Additionally, viruses propagated in cell culture fiequontly acquire mutations in their 
RNA genomes that display attenuated phenotypes in vivo, while still retaining their 
immunogenicity. Attenuated virus strains would be impaired in fheir ability to cause disease 
and establish chronic infections. Production of HCV variants adapted for tissue culture may 
5 represent potential candidates for live-attenuated vaccines. An attractive possibility is the 
production of HCV derivatives containiiig the deletion in NSSA described in tiiis application 
ascloneI(seeExanq[>le 1). Such a variant is less likely to revert to wild type in the host 

HCV Variant-based Gene depression Vectors 

10 Some of the same properties of HCV les^ding to chronic liver infection of humans may also be 
of great utility for designing vectors for gene expression in cell culture systems, genetic 
vaccination, and gene tfaenqjy. Ibe HCV variants described herein can be engmeered to 
produce chimeric KNAs designed for the repression of l^terologous gene products QUNAs 
and proteins). Strategies have been described above and elsewhere [Bredoibeek and Rice, 

15 (1992) siq>ra; Frolov et al., (1996) stqnu] and include, but are not limited to (i) in-frame 
fusion of the heterologous coding sequences with die HCV polyprotein; (ii) creation of 
additional cistrons in tiie HCV genome RNA; and.(iii) inclusion of IRES elements to create 
multicistionic self-replicating HCV vector RNAs capable of expressing one or more 
heterologous genes (Figure 2). Functional HCV RNA backbones utilized for such vectors 

20 include, but are not limited to, (i) live-attenuated derivatives capable of replication and 
spread; (ii) RNA replication competent "dead end" derivatives lacking one or more viral 
conqxments (e.g. the stnictucal proteins) required for viral spread; (iii) mutant derivatives 
cqmble of high and low levds of HCV-spedfic RNA synthesis and accumulation; (iv) mutant 
derivatives adq>ted for replication in diSeaxsA human cell type^ (v) engineered or selected 

25 mutant dmvatives capable of proloi^edncHM^ytopaiUcrepUcation in h Vectors 
competent for RNA. rq>lication but not packaging or spread can be introduced either as naked 
RNA»DNA» or packaged into virus-like particles. Such virus-like particles can be produced 
as described above and conqposed of eilber unmodified or altered HCV virion components 
designed for targeted transfixtionoftfaehepatocytes or otiier human ceUlyi^ Alternatively, 

3 0 HCV RNA vectors can be enciq>sidated and deliveied using heterologous viral packaging 
machineries or encapsulated into lq)08omes modified for efiBciCTt gene deU These 
packaging strategies^ and modifications tbereoC can be utilized to efficiently target HCV 
vector RNAs to specific ceQ types. Usmg methods detailed above, similar HCV-dmved 
vector systems, competent for repUcation and expression in other species, can also be derived. 
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Various methods, e.g., as setfatOxsiqfra in connection wifh transaction of cells and 
DNA. vaccines, can be used to intn>duce an HCV vector of the inventim^ Of primary interest 
is direct injection of functional HCV RNA or virions, e.g., in the liver. Targeted ^e 
delivoy is desmbed in International Patent Publication WO 95/28494, published October 

5 1995. Altenmtively,&e vector can be introduced mvn^ by Upofection. For fte past decade, 
diere has been increasing use of liposomes far encapsulation and transfisction of nucleic acids 
in vitro. Synthetic cationic lipids designed to limit tiie difSculties and dangers encountered 
with liposome mediated transfection can be used to prqpare liposomes for in vivo transfection 
of a gene encoding a marker [Feigner, et al., Prac NatL Acad. ScL U.SLA. 84:7413-7417 

10 (1987); see Mackey, etoL, Proc Nad Acad. Set U.SjL 85:8027-8031 (1988); Uhner et al.. 
Science 25!hl74S-*1748 (1993)]. Ihe use of cationic lipids may promote encapsulation of 
negatively charged nucleic acids, and also promote fusion with natively charged cell 
membranes IFelgnor and Ringold, Science 337:387-388 (1989)]. Theuseoflipofectionto 
introduce exogenous genes into the specific organs in vivo has certain practical advantages. 

IS Molecular targeting oftiposomes to specific cells rqiresents<me area of b^ It is clear 
tiiat directing transfection to particular cell types would be particularly advantag^us in a 
tissue with cellular heterogeneity, such as pancreas, liver, kidney, and the brain. Lipids may 
be chemically coupled to otiier molecules for the purpose of targeting [see Mackey, et aL, 
supra]. Targeted peptides, e.g. , hormones or neuro1ransmitl)»s, and proteins such as 

20 antibodies, or non-peptideinolecules could be coij^led to liposomes chemically. Receptor- 
mediated DNA delivery approaches can also be used [Curiel et al. Hum. Gene Titer. 3:147- 
154 (1992); Wu and Wu, J. Biol Chem. 262:4429-4432 (1987)]. 

Examples of applicaticHis for gene therapy include, but are not limited to, (i) 
expression of enzymes or otiier molecules to correct inherited or acquired metabolic defects; 

25 (ii) expression of moloniles to promote wound healing; (iii) expression of immimomodulatory 
molecules to promote immune-mediated regression or elimination of human cancers; (iv) 
targeted expression of toxic molecules or enzymes enable of activating cytotoxic drugs in 
tumors; (v) targeted expression of anti-viral or anti-microbial agents in pathogen-infected 
cells. Various thmpeutichetmlogous genes can be inserted ma gene therapy vector of the 

30 invention, such as but not limited to adenosine deaminase (ADA) to treat severe combined 
immunodeficiency (SOD); maxkBT genes or lynqdioldne genes into tumor infiltrating (TDL) T 
cells (Kasis et aL, Proa NatL Acad ScL UJIA. 87:473 (1990); Culver et al., ibid 88:3 155 
(1991)]; genes for clotting Actors such as Fact«»- VHI and Factor IX for treating hemophilia 
Pwarld et aLProa NatL Acad ScL USA, 92:1023-1027 (19950); Thompson, Ihromb. and 

35 HaemostatiSt 66:1 19-122 (1991)]; and various other well known therapeutic genes such as. 
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but not limited to, P-globin, d/strophin, insulin, eiytfaropoietin, growQi honnone, 
glucocerebrosidase, p-glucuronidase, a-antitiypsin, phenylalanine faydioxylase, tyrosine 
hydroxylase, (xnilliine transcaibamylase, apolipoproteins, and the like. In general, see U.S. 
Patent No. 5^99,346 to Anderson ei td. 

S Exanqples of applications for genetic vaccination (for protection fixrni pathogens other 

flian HCV) include, but are not Umited to, eaqxression of protective antigens fiom bactaial 
(e.g., uropathogenic K colU Strq^taccod, SUgpMococd, Nisseria\ parasitic 
Plasmodium, Leishmanial Taxoplama\ fungal (eg., Candida, Histoplasma) , and viral (eg., 
HIV, HSV, C!MV, influenza) human patfaogwt. Immunogenicity of protective antigens 

1 0 expressed using HCV-derived RNA expiession vectors can be enhanced using adjuvants, 

including co-esqiiession of immunomodulatory molecules, such as cytokines (e.g., IL-2, GM- 
CSIO to fecihtate development of desired ThlvemisIhZieqio^^ Such adjuvants can be 
dfter incoipcHated and co-expfessedby HCV veclore fhe^ 
combination with tiiese vectors using oflier methods. 

15 

Diagnostic Methods for Infectious HCV 
Diagnostic cell lines. The invention described herein can also be used to derive cell 
lines for sensitive diagnosis of infectious HCV in patient samples. In concept, functional 
HCV components are used to test and create susceptible cell lines (as identified above) in 

20 which easily assayed reporter systems are selectively activated upon HCV infection. 
Examples include, but are not restricted to, (i) defective HCV KNAs lacking replicase 
components that are incorporated as transg^es and whose replication is upregulated or 
induced \xpon HCV infection; and (ii) sensitive heterologous anq>lifiable repealer systems 
activated by HCV infection, hi flie first manifestation, RNA signals required fa: HCV RNA 

25 ampMcation flank a convenimt or a select{d>le marker (see above). E^ession of such 

chimeric RNAs is driven by an appropriate nuclear promoter and elem^its required for proper 
nuclear processing and transport to the cytoplasnt Upon infection of the engineered cell Une 
with HCV, cytoplasmic repUcation and amplification of fiie transgene is induced, triggering 
hig^ levels of rqxxrter expression, as an indicator of productive HCV infection. 

30 hi the second example, cell lines are designed for more tightly regulated but hig^ 

inducible report e r gene amplification and eaipressicm upoa HCV infectioiL Altiiougihttiis 
amplfied system is described in the ccmtext of spedSc components, otiier equivalent 
components can be used In one such systeni, an engineered aIphavirusrq>Ucon transgene is 
created which lacks tiie alphavirus nsP4 polymerase, an enzyme absolutely required for 

3 5 alphavirus RNA amplification and normally produced by cleavage fi'om tiie nonstructural 
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polyprotdn. Additional features of this defective alphaviius rqilicon include a subgeoomic 
RNA promote, driving expression of a lucifen^ This promoter 

element is quiescait in the absence of productive cytoplasmic alphavirus replication. The ceU 
line contains a second transgene for expression of gw fusion consisting of the HCV NS4A 

5 protein and the alphavirus nsP4RDKP. This fused gene is expressed and targeted to the 
cytoplasmic membrane conqKutment, but Ibis form of nsP4 would be inactive as a functional 
component of Hie alphavirus replication complex because a discrete nsP4 protdn, wilb a 
precise N terminus is required for ndP4 activity pLCTmieto^^ An 
(q>tional tiiird transgeoe expresses a defective alphavirus RNA with as signals for rq>lication, 

1 0 transcription of sub^omic KNA mcodir^ a ubiquitin-iisP4 fusion, and an alphavirus 
packaging signal l^n infection of sudi a ceU line by HCV, the HCV NS3 proteinase is 
produced, mediating irons cleavage of flie NS4A-4isP4 fiisim protein, activatiiig die iisP4 
polymerase. This active polymerase, wUchflmctions in Irons and is efiT^ 
amounts, then forms a functional alphavirus replication complex leading to amplification of 

IS the defective alphavirus replicon as well as &e defective alphavirus RNA encoding ubiquitin- 
nsP4. Ubiquitin'-nsP4, expressed ftxmi its siibgenomic KNA, is cleaved efficientty by cellular 
ubiquitin caiboxytominal hydrolase to product additional nsP4, in case ^s enzyme is 
limiting. Once activated, this system would produce extremely high levels of flie reporter 
protein. The time scale of such an HCV infectivity assay is expected to be from ho 

20 sufGcient reporter gene eapresdon). 

Antibody diagnostics. In addition to flie cell lines described here, HCV variant virus 
particles (virions) or components thereof produced by the transfected or infected cell lines, or 
isolated from an inflected animal, may be used as antigens to detect anti-HCV antibodies in 
patient blood or blood products. Because the HCV variant virus particles aie derived fixxm an 

2S auflienticHCV gmome, particular components suchas^e coat proteins are likely to have 
inmiunogenic properties that more closely resemble or are identical to natural H(^ 
if those conq[)onents were produced outside of a replicating HCV. Examples of such 
immunogenic properties include ttie display of wild-type HCV immunogenic qiitopes, and 
moduktion of transcription of genes encodiiigcellidarimmu^ These 

30 reagents can be used to establish ttuit a patient is infected with HCV by detectitig 
seroconversion, Le,, generation of a pcqadation of HCV-specific antibodies. 

Altmatrvely, antibodies genoated to the HCV variant products prepared as 
described berein can be used to detect the presence of HCV in biological samples fiom a 
subject 
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Preferred embodiments of the invention are described in Ibe following example. 
Other embodiments within flie scope of Ae claims herein will be apparent to one skilled in flie 
art from consideration of the specification or practice of the invention as disclosed herein. It 
is intended fliat the specification, togelher wilh the examples, be considered exranplazy only, 
S withthescopeandspiritof the invraiiion being indicated by the claims wMchfoU^ 
exanq>les. 

Example 1 

Ibis exanq>le describes the production and evaluation of replicons comprising a neo 
10 selectable marker and a polyprotein coding region encoding subtype lb nonstructural 
proteins. 

Materials and Methods 

Cdlllnes. The Huh7 ceU lii^ were gen^usly provi ded by Robot Lanf ord 
(Southwest Foundation fiir Biomedical Research, San Antonio, U.S A.) and Ralf 

15 B artenscUager (Johannes Gutenberg Univ^ty Mainz, Mahus, Gennany) and maintained in 
Diilbecco's modified Tntntmal essential media QDMEM; Gibco-BRL) siqyplemoited wifli 10% 
fetal calf serum (FCS), and nonessential amino acids. 

Assembly of a sdectable subtype lb rqilicon. An HCV subtype lb replicon was 
constructed wlucb is similar to the replicon described in Lohmarm et al . , Science 285 : 1 1 0- 1 1 3 

20 (1999). For that construction, a step-wise P(3l-based assay utiUzingKlenTaql^ 

polymerase (Wayne Barnes, Washington University) was developed. cDNAs spanning 600- 
750 bases in lengOi were assembled fiom 10-12 gel-purified oligonucleotides (60-80 
nucleotides in lengdi) wifti unique CQnq>lementary overlq>s of 16 nucleotides. Four or six 
oligonucleotides rqnresenttng the 5' pcvtion of the region to be assembled were annealed and 

25 extended in a standard FCEL llieranaining six oligonucleotides for the synthesis of the 3* 
half of the intended cANAwero mixed in a parallel PCSEl reaction. After 12 cycles of PGR, 
die extended doubl&-strandedDNA products were combined and subjected to an additional 12 
cycles. The pn>ductoffliis reaction resolved as a smear on agarose gels which was excised 
and tbe DNA isolated fcom the agarose. One-fifth of the purified double-stranded DNA 

30 product was amplified by PGR usmg an outer primer pair containing unique restriction 

enzyme sites to fecilitate directional cloning into the pGEM3Zf(+) plasmid vector (Promega). 
PGR products were purified, digested widi appropri ate restriction en2ymes, and ligated into 
similarly cleaved pGEM3Zf(+). Multiple lecombiiiant clones were sequenced and flie correct 
clones identified. Tbe overlapping cDNAfiagments were assembled into the contiguo 

35 rq>licon sequence. In paraOd, a rq[>liconcanyirKgfteleflial mutation in tbeNS5B active site 
(Gly-Asp-Asp [GDD] to Ala^-Ala-Gly [AGG]; pol*) was constructed. 
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KNA transciiptioii and transfection. RNA transcripts were synthesized in a lOOjil 
reaction nuxtore containing 40niM Trisp-HCI (pH 7.9), lOmM NaCl, 12mM MgCl2> 2niM 

spennidine, 3mM each ATP, CTP, GTP and UIP, lOmM ditfaiodireitDl, 100 U KNasin 
(Promega) and 100 U T7 RNA polymerase (Epicentre), and 2]ig Sea /-linearized DNA. The 

S DNA tenq)late was rigorously removed by serial digestions wifh 30 U DNase I (Boehring^). 
Ten (ig of the DNase-digested RNA transcripts were electroporated into 6x 10^ Huh7 cells 
using a model T820squareporator(BTX), and plated on ISOnun dishes. For selection of 
rq)licon-Hxmtaining cells, medium was changol to complete medium containing geneticin 
(G418; ImgAnl; Oibco-BRL) at 24 hr post-transfection and thereafter fiie media was changed 

10 ev«ty 3-4 days. 

RNA analysis. Approximately 5x 10^ cells wim preincubated for 1 h in DMEM 
lacking phosphate supplemented with 5% dialyzed PCS, 1/20^ the normal concentration of 
phosphate and actinomycin D (4^g/ml; Sigma). [^^]orthophosphate (200^Ci/ml; ICN) was 
added and the incubation continued for an additional 12 1l Total cellular KNA was extracted 

15 witti TRIZOL, precipitated, and resuspoided in H2O (Gibco-BRL). Radiolabeled RNA was 

analyzed by denaturing agarose gel electrophoresis and visualized by autoradiogn^hy. 

Frotdn analysis. For iomiunoprecqritation, cell monolayeis were incubated for 

til 

eidier4, 8or 12 bin methionine- and cysteine-deficient MEM containing 1/40 thenonnal 
concentration of metiuonine, 5% dialyzed PCS and Express 3Ss3Ss protein labeling nux 
20 (100)iCi/ml; NEN). Cells were lysed in lOQmM NaP04 pH 7.0 containing 1% sodium 

dodecyl sul&te (SDS) and protease infaibitora, and cellular DNA sheared by repeated passage 
tinough a 27.5 gauge needle. Viral proteins were immunoprecipitated essentially as described 
previously (Grakoui etal^ 1993), using patient serum, JHF, recognizing NS3, NS4B and 
NSSA or rabbit anti-NS5B and Pansoibm cells (Calbiochem). ^bnmunoprecipitates wm 

25 separated on 1 0% SDS-PAGE and visualized by autoradiography. 

Immunostaining. Cells cultured in 8 well chamber slides (Falcon) were fixed in 
acetone for lOmin at 4^ and allowed to air diy. Rehydrated monolayers were incubated at 
37^ .witii an antibody directed against NS3, followed by inciibation wilii a species-specific 
fluorescem-conjugated secondaxy antibody (Pierce), md mounted in 90% glycerol saline 

30 containing SOmMTris-HQ^H 8.8). 

Reverse transcription (KiytCSL KNA was isolated firom cells using TRIZOL 
(Gibco-BRL), prec^itoted and resuqpended in H2O. Levels ofHCV SNA were quantitated 
using conq}etitive RT-PCR assays designed to amplify tte 5 ' and 3' NTR sequences of HCV 
(Kolykhalov et al^ 199Q. For RT-PCR designed to amplify long cDNA fragmaits, about 

35 1000 molecules of HCV RNA was mixed with the HCV-specific primer, and the primer 

extended at 43.5^0 for 1 h using SupCTscript II reverse transcriptase (Gibco-BRL). cDNAs 
were then amplified wifli KlenTaqLA DNA polymerase using 35 cycles of 95^0 for 30 s, 55- 
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60^C for 30 s, and 68°C for 4 min. PC3R. products were recovered from preparative low 
melting-point agarose electrophoresis by phoiol extraction, and '-40ng of purified PGR 
product directly sequenced. 

5 Results 

Establishment of G418-redstaiit colonies. Repliconssinoilar to that described in 
Lobmann et al, svpra^ but derived from tibe H77 infectious clone, failed to confer resistance to 
iG41 8 in five differmt hepatoma cell lines. Sequencesof subtype lb were also used to 
assemble the repUconl377/NS3-3*(EKifBL accession number AI2426S2). RepliconRNAs 

10 were composed of tiie HCV intmial ribosome enfay site (IKES) driving neomycin 

pho^hotiansferase gene (Meo) expression and tiie IRES from encq>haIomyocaniitis virus 
(EMCV), directing translation of HCV iwoteins NS3 to NS5B, followed by the 3' NTR ) 
(Figure 3). Two derivatives woie constructed which either lacked 2 U nucleotides in tiie poly 
(U/UC) tract or carried an ^vall restriction oizyme site in the variable region of the 3' NTR, 

15 dedgnatedHCVreplbBartMan/A2U'sandHCVreplbBarAlan/Av^ Prior to 

transfection, translation and conect polyprotem processing was ccmfirmed for each cDNA. 
sequme using the vacdnia-T? KNA polymerase expression system (data not shown). 

DNase-treated lepliccm RNAs were eledroporated into Huh7 cells and afi^ 2-3 
weeks in culture G4I8-resistant colonies were clearly visible. Both replicondrnvatives were 

20 abletoconferG418redstance,andonaverag^oa]yl in 10^ cells became G418i^ In 
contrast, colonies were never observed for Huh7 cells electroporated in parallel wi& the 
' replicon RNAs containing an inactive NS5B polymerase. 

Verification of autonomous rqillcation. Twenty two independent colonies were 
isolated, 5 colonies corresponded to Huh7 cells transfected with ENA transisibed from 

25 HCVreplbBartMan/A2Us and the remaining 17 colonies were derived firom 

HCVreplbBartMan/Avall KNA. A number of assays were paformed to varity that G41 8 
resistance was mediated by autonomously rqplicating HCV. Anq>lification of sequoioes 
within the 5' and 3' NIIls in a quantitative RT-FCR assay revealed ccqiynumberaw 
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Identification of mutations in HCV replicons. The low fiequmcy of G41 8- 
resistant colonies may be attributed to eiflier a cell &ctor(s) requirement for rq)lication or 
adaptive changes within fte replicon sequence necessary for tibie establishment of HCV 
replication. To address the ktterpossibiUty, the entire replicon sequence was aniplified^^ 

S cDNA reverse transcribed fiomRNA isolated fnom five independent G418-re8i 
clones. Upon direct seq^cingof^ purified PGR population, multiple mutations 
idmtified. The strildxig observation was diat each ceU clone canied a single n 
change within NSSA resulting in a coding change (Figure 7). Iiioneinstance»adeletionof47 
amino acids Q; Figure 7), encompassing tiie intofenm sensitivity determining region (ISDR)» 

10 was found. Sequence analysis of NS5 A from ano&er 8 G41 8-resistant cell clones revealed 
similar point mutations, although 2 clones, ^^ch have low levels of HCV replication and 
slow growfli rates (e.g., clone E in Figure 4), were found to contain wild type NS5A. In 
addition to tiie identified NSS A mutations, nucleotide substitutions were also noted in NS3 
and NS4B; Qone H (SEQ ID N0:9) contains substitutions at nt 3550 (NS3) and nt 4573 

15 (NS4B) (Lys (584) to Glu, and Sa:(925) to Gly of SEQ ID KO:3, embodied in SEQ ID 
NO: 1 7), whoeas nt 2060 (NS3) was mutated in done VI (Figure 7, corresponding to Gin 
(87) to Axg of SEQ ID N0:3, unbodied in SEQ ID NO:15). 

Reconstnictioii off mutant replicons. To determine if the nucleotide changes and 
the deletion identified in NSS A were adaptive, each mutation, except mutation II, was 

20 indepmdentlyeng^eered back into tiieHCVrqjlbBart^^ KNA 

transcribed from each reconstructed r^licon was electroporated into naive Huh7 cells, and 
the number of G418-iesi5tant colonies compared to that obtained for tiie 
HCVreplbBartMan/Avall replicon containing wild type NS5A. Ibe 47 amino acid deletion, 
as well as the point mutations, were cq)able of increasing tiie fiiequency of G418-resistant 

25 colonies to at least 1% of the initial electroporated cell population (Figure 8), indicating these 
mutations targeting NSS A are adaptive allowmg effideait HCV replication in Huh7 cells, la 
addition, G4 1 Snredstant colonies wete observed after transfection of HeLa cells, a Innnan 
epitiielial cell line, witiirqpliconRNA of clone L llierefore, at least one ofthennitations that 
was adaptive in Hiih7 cells also allows tiie establishment of HCV rqilication in a non-hq)atic 

30 cell line. 

Example 2 

This example describes the production of cell lines permissive for HCV replicatioi^ a 
replicon comprising the NS2 coding regioi^ and full-l^gth HCV cDNA clones comprising 
the Ser tone substitution at position 1179 of SEQ ID NO: 3. 
35 Generation of cell lines. As shown in the previous example, G418-resistant cell clones 

harboring persistently rqilicating HCV RNAs were isolated. Two oftheseG418-Tesistant cell 
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clones weie treated extensivety witfi the antiviral, interferan-a, to obtain 2 cell lines void of 

HCVKNA. 11ieseareiefexedtoasinterfm»t-treatedceUl^ 

HCYreplbBartMan/Avall, HCV adaptive rq>licon I or HCV ad£q)tiye leplicon Vn 

were transfected into die interfieninrtreated cell li^ This resulted in a greater G418 

S transduction e£Gdency11iantlmt observed for the pare^ Early 

post-transfection HCV RNA amplification was greatest for the IFN-treated cell line. These 

results indicate ftai die cell Unes^ interferan-treated cell lines I and n, are more pecmissive for 

HCV replication than is tibe parental Huh-7 cell line. 

Such cell tines are not onfy valuable for genetic study of HCV, but also for examining 
10 the cellular environments more p^missive for HCV replication. For example, microarray 

technology will allow us to look globally at diffeences in gene expression profiles between 

the different cell lines. 

Constractioii of rqplicons* Areplicon was constructed wherein the S'NTR of HCV was 
fiised to the IRES of EMCV iqistieam of NS3 , thus oeating a replicon lacking Oie neomycin 

IS phosphotransferase gene. This replicon, smR-EMCV/HCVrepVII (SEQ ID N0:2S), 
rq>Ucates to high levels in Huh? cells, as sliown in Figure 10. Another replicon, 
HCVrep/NS2-SB (SEQ ID NO:22) was made wherein the non-structural protein, NS2, is 
upstream of NS3. As shown in Figure 10, fids rqplicon is also rq>lication-<xnnpetent in Huh7 
cells. Tliis latier replicon can be used advantageously, for example, in testing compounds for 

20 inhibiting HCV replication. The addition of flie NS2 coding region provides an additional 
target for sudi antiviral compounds, as well as providing an additional protein for genetic 
study. 

Fnll-lei^th HCV RNAs. Two fiiU-lengdi HCV cDNA clones were assembled The first, 
HCV FL (SEQ ID NO:24), contains the mutation tiiat encodes a Sct to De substitution in 

25 NSSA,as^wnatpositiQnll79ofSEQIDNO:3(seeFigure9). Ibe second, HCV FL-Neo 
(SEQ ID NO:23), also encodes the Ser to He mutation, and in addition, comprises the 
neonorcin phosphotransfiarase gene immediate^ 3' of fiie 5' NIR and fiie EMCV IRES 
i3Damediately 5* to flie HCV open reading frame (see Figure 9). Bothof tiiesefidl-loigfii 
clones repficate in file interfisfon-treatedceU line I, as s^ Tbisresult 

30 indicates that HCV replication is riot dependent on the EMCV IRES driving the non-structural 
proteins of HCV, because tiie non-structural proteins of fiie HCV FL clone are driven by fiie 
HCV IRES in file full-length clone HCV FL 

In addition, a G41 8 resistant cell line conqmsing the HCV FL-Neo clone has been 
generated fiom the int^eron-treated cell line I described above. This cell line supports high 

35 levels of pmistraifly replicating HCV FL-Meo RNA. 
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All references cited in this specification are hereby incorporated by reference. The 
discussion of fhe referaices herein is intended merely to summarize the assertions made by 
the authors and no admission is made that any reference constitutes prior art. Applicants 
reservG the right to challenge the accuracy and pertinence of the cited references. 

5 In view of the above, it will be seen that &e several advanta^ of Reinvention are 

achieved and other advantages attained. 

As various changes could be made in the above mediods and compositions without 
departii^ fixnn the scope of the invention, it is intended ftat all matt^ contained in the above 
desoription and shown in the acoonqianying drawings and appendix shall be interpreted as 

10 illustrative and not in a limiting sense. 
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Appendix 
SEQIDNOs 

SEQroNO-.l : S'portionofaiiHCVS'NTR 

5 

GGCGACACTC CACCATAGAT C 



SEQIDN0;2 : 3' portion of a 3' NIK. from a ^d-1ype HCV subtype la 

10 

TGGTGGCT(X:ATCTrAGC(XrrAGTCACGGCTAGCrGTGAAAGGTOCGTC 
ATGACTtKAGAGAGKKnXSATACTGGOJK^^ 



IS SBQIDN0;3 ; Andno acid sequence of Hbe polyprotdn regiaa of HCVieplbBarfMan 

MAPlTAYSQQlRGIIjG(mSLTCaa)Kl«}VEGEVQVVSTATQSF^^ 

HGAGSKTI^GPKGPm3MYT^^V^DQDLVGWQAPPGARSLTPC^CGSSDLYLV^ 

VIPVIU»GDSRGSII^PRPVSYIXGSSG(^IIjCPSGHAVGIFRAAVCTRGVAKAV^ 

20 PVESMETTMRSFVFroNSSPPAVPQIRJVAHLHAFro 
VU^PSVAATlXjFGAYMSKAHGIDPNIRTGWTriTC 
Dma)ECHSTDSTllIX3IGTVII>QAETAGAM.VVlJSLTATPPGSVT^ 
GEIPFYGKAIPIEmGGRHIJFCHSKKKCDELAAKI^GLGLN^ 
GDVIWATDALMTGFTGDFDSVnxamiVTQTV^ 

25 RGRTGRGRMGIYRFVTPGERPSGMFDSSVIXIECYDAGCAWYEL 

NmSIPVajDHLEFWESVFTOLTHroAHFI^QTKQAGDMTYLVAYQATVCARAQA 
PPPSWIXJMWKCmiXPTLHGPmXYRWjAVQNEVTI^ 
TSTWVLVGGVIAAI^YCI>TTGSVVIVGRnLSGECPAIIPDREVLYREFDE 
IJ»YIEQa^4QIJ^EQFKQKMGIXQTATKQAEAAAPVVESKWRTIJaA^ 

30 GIQYIJlGi;STLP(MPAIASIJNiAFrASriSPLTIXJHl^^ 

VGAGIAGAAVGJaGIXaCVLVDlLAQYGAGVAGALVAFKVMSCTMPSlBDLV^^ 
n^PGALWGWCAAIIJaaiVGPGEGAVQWMimiAFASRGbmVSPTHYWES© 
ARVTQn^SLTHQIlJKRmQWINEDCXTPCSSKjSW 
UJPRLPGVPFFSOJRGYKGVWRGIXaMQflTCPC^ 

35 WHGTEPINAYTTOPCTPSPAPNYSRALWRVAAEEYVEVTRVGDFH^ 
CPCQVPAPEFFIEVIXjYRLHRYAPACXPIIJIEE 

AVLTSMLTDPSHITAETAKRRIARGSPPSIASSSASQI^APSLKATCriRHD 
EAraXWRQEMGGNnRVESENKVVnJ>£3nBPIX2AEEDEREVSWA^^ 
PIWAIOT3YNPPIJLESWKDPDYWPVVHGC3»lJTAKAPPff^ 
40 ABLATKTFGSSESSAVDSGTATASP1XJPSDIX3DAGSDVESYSSMPPLEGEP(H)PDLSD 
GSWSTVSEEASEDWCCSMSYTWTQALIIPCAAEETKIJmAI^SIIJlHHhlLVYAT 

ARSKFGYGAHDVRNI^SKAVNHIRSVWKDIl^DTETPIiy^^ 
RKPARIJV^DIXiVRVCSEKMALYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAW^ 

45 AKKCmGFAYiymaiDSTVTEra>mVEeSiyQOCDIJ^FEARQAIRSL^ 
NSKGQNCGYRRC3tASG\a.TTSOG^m,TCYIlCAAAA<3lAAKIX5E^^ 
lOESAGTQEDEASUU^FIEAMlRYSAPPGDPPKPEYDmjniCSSI^ 
VYYLTEa)FITPIJVRAAWETARHTPVNSWIXjMIMYAPTLWAR^«^^ 
QimAIJXXJIYGACYSffiPIJ>IJ>QnQRIfiGLSAFSLHSYSPGEINRVASa^^ 

50 RVWRHRARSVRARI15QGGRAATCGKYIJNWAVRTKOa-TPIPAASQII)I5S^ 
OYSGCH}rraSLSRARFR\mfWCLIll^aVGIYLLI>NR 
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SEQ ID NO:4 : Amino acid sequence of the NS5A protein of HCVr^lbBarfMan 

5 .SGSWIJU)VWDWICriVLTDFKTWLQSmjRLPGWFFSCQRGYKGVWRGTO 
CTCGAQHGHVKNGSMiaVGPRTCSNrWHGTFPmAYTrGPOTSPA^ 
AAEEYVEVTRVGDFHYVTGimDNVKCPCXJWAPEFFIEVlXjVIUJ^ 
REEVTFLVGUfQYLVGSQUCEPEPDVAVLTSMLlDPSHITAErAKK^^ 
SSSASQI^APSUCATCTTKHDSia)ADUBANIXWQEMGGNlTRVK 

10 PIX2AEEDEREVSWAEILia«KKFPRAMPIWAIU»DYNPPIl£SWKDPDY^ 
IPPAKAPPIPPPRWHOVVI^ESWSSAIJVEIATOFGSSESSAVDSOT 
DGDAGSDVESYSSMPPLEGEP(H)PDLSDGSWSTVSEEASEDWC3C 



IS SEQ ID NO:S : Nucleotide sequmce of DMA clone of HCVreplbBarfMan/A2U'8 

GCCAGiXmDGATTOGGGGOGACACTOCAOCATAQATCACK^^ 

TACIOTCTrCACXK^AGAAAGCXriCrAGOCAT^^ 

CXnXXAGGACC(XCOCnXXOGGGAGAGCCATAGTGGTCriX^ 

20 CAOXKJAATTGCCAGGACXJACXXKKnXXnTItrrT^ 

GGAGATITGGGCGTXXXXCCGCXjAGACrGCrAGCXXJAGTAGT^ 
AAGGCXnTGTGGTACKKXnGATAGGGTGCTIXKXSAGTGCCC^^ 
AGACCGTGCAa:ATGAGCACGAATCOV^CCrcAAAGAAAAACCAAAGGGCGC 
GiXATGATTGAACAAGATGGATTGCACGCAGGTIXncaKKXXK^ 

25 GGCTATKXKKnATGACKKKK^ACAACAGACAATOGGCn^^ 
GTTCOGGCIGTCAGCXKAGGGGaKXXXKnTC^^ 
GGTGCanX5AATGAA(nXK:AGGACGAGGCAG0GCGGCTATCGTG^ 
ACGGGa3TrccnTGCXX:AGCK3TGCrcGACX^ 

ggctgctattxkkkxjaagtgcxxkkkk:aggatc^^ 
30 tgoxsagaaagtatcct^tcatggctgatgcaatgcggcggcnracataog^ 
cc)ggcta0ctgcccatrcgacx:accaagcx3aaacatcgc 

ACnXXKSATGGAAGCXXIGTCrriXJIXXJAT^ 

GGGCTCGOGCCAGaX5AACTGTIXX3(X;AGGCrcAAGG(^^ 

GAGGATCnrOGTCmGA(XGATGGa5ATGCXnXKnT^ 

35 ATCKKXGCTirrcrcGATICATCXSAC^^ 

TCAGGACATAGCXmXKKnACXXXntSATATIXKTOAAGAGC^^ 
GCTOACXXOTCXTOGTCKnTTAaKnATC^^ 
CTTCTAT0GCCTIUITGACX3AGTrcTTCTOAGm 
CTCTAGCXKKSATCAATICCXKXXXnUItXOt^^ 

40 AAGOCXKnTGGAATAAGGCXXKntnGCGTrKnCT^^ 
TGCCGTCTITKKSCAATOiXSAGGGCCCGGAAACCK^^ 
CATIXXTAGGGGl<OTimXTCrrCXXX:AAAGGAATGCAAW 
GTGAAGGAAGCAGTTCCTXntjGAAGCnCTTOAAQACAAAC^ 
ACX:CTITGCAGGCAGCGGAACmXX:A(Xrit3GCGACAGGTGC(nUI^^ 

45 AGa:ACX3TGTATAAGATACACXnXK:AAAGGa3GCACAACCCCAGTGCGACGT^ 
TGAGTTGGATAGTTOTGGAAAGAGTCAAATGGCIXJR^^^ 
GGGGCTCAAGQATXKXXJAGAAGGTACOCCATTOTATGGGATCTO 
CXKnGCACATWJmACATGTGTrrAGTCXSAGGTT 
GAA<XACXKK3GAa3TGGTTriXXTITGAAAAACACGATAAT^ 

50 TACGGCCTACim:AACAGACXK:GAGGCCTACTIX3GCKK^ 

ACAGG0CXKK3ACAQGAACX:AGGTCXJAGGGGGAGGTa;AAGlXKn<^^ 

ACACAATCTTKXriXKK^GACXTCCGTCAATXK^^ 

GTG<XX3GCTCAAAGACCOTG<XGGax:AAAGGGCO:7^ 
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CCAATGTGGACCAGGA^CXnXXnaKKnXKKIAAGCXKXXXXX^G 
GACAO^ATGCACCKKXKKJAGCTCXSGACCTITACITGGTCAC^ 
GTCATIXXGGTGOSCCXKKXKKKKXiAC^ 
aXXnxnaJTACTTGAAGGGCTCTiaKKXXKS^ 
5 ACGCTOTGGGCATCmCGGGCTGaXJTGTGCACCCC 

GGACirrOTACCCGTaSAGTCTATGGAAACCACTATGCGGTC 
GACAACrCXmXXCimKSa^GTAOXKIAGACAl^ 

axctacnxkh'agcggcaagagcactaaggtgccggcrgcgtatgcagcccaag 
ggtataaggtgcttgtcxtoaa(xcgtcx:gtcxk^ 
10 gtatatgtctaaggcacatggtatosaccxtaacatcagaacxxkk^ 

CATCACCACXKKnxkxXTCATCACCTACia: 

GGTGGTTGCrcTGGGGGCGCXnATGACATCATAATATGra 

CTGACnx:GACCACTAT(XTGGGCAiaX3CACAGTCCrGGACCAAGC^ 

CTGGAGOKXfACIXXritXjTGCTOGCXIACCGCT^ 
15 G<XACATCX:AAACATOGAGGAGGTGGCnXTOTCCAGCACrG^ 

TATGGCAAAGCCATaXX:ATOGAGACCATCAAGGGGGGGAGGCACXn-CAT^^ 

Ga:ATrCCAAGAAGAAATCTQATGAGCrcGaXKX5AAGC^^ 

CAATGCTGTAGCATATTACXXKXKSCCnTOAT^ 

GACGTCATIGTayrAGCAACGGACGCTCTAATGACGGGC^^ 
20 ACrcAGTGATC!GA(nXK:AATACATCTGrcA(XCAGACAGTra 

C)(XX5ACXTrcACX::ATrGAGACX5ACX5ACX3GTGCX;^^ 

CAGOKKXjAGGCAGGACTCCrrAGGGGCAGGATGGGGATTTACAGGl^^ 

CX:AGGAGAACXKKXXnx:GGGCATGrn«5ATTCC^^ 

ACXKXKSGClXnGCITGGTAOGAGCIX^ 
25 GGCTrACXn7VAACACA(XAGGGTK3CCCGTCTG^ 

GAGAGOncmACAGGCCntACCCACATAGACGCCCATlTC^^ 

AGCAGGCAGGAGACAA(nTCCX:CTA<XTGGTAGCATACCAGGCTACGGTGT^ 

CX:AGGGCTCAGGCKX}A<XTCCATCXnX3 

GCTAAAGCO'ACXSCreCACXKKKXJAACXXXXXnGCTO 
30 CAAAACXSAGGTTACTACCACACAaXXJATAACX^AAATACATCATGGCATGCATGT 

CXKKHXJACXnXKSAGGTCXiTCACXJAGCACX^^ 

CAGCICTOGCCXXXn'ATKKXTOACAACAGGCAGC^^ 

CATCTIXmXXK}AAAG(XGGCX:ATCATIXXa3ACAGGGAAG^ 

TKXJATGAGATGGAAGAGTGOSCCTCACACXnCCCrrrACAT^ 
35 AGCiaKXX3AACAATTCAAACAGAAGGCAATCXKK3TTGCT^^ 

AGCAAGOKSAGGCnXKnGCnmiXjnSGTGGAA^ 

CCTTCnGGGCXSAAGCATATGTCSSAATTK^^ 

CTIGTCCACrCTGCCIGGCAACCmK^ 

TCrATCACX:AGC5CCGCTCACCACXX^ 
40 GATGGGTGGCCXKXCAACrTGCrccrcXXIAGC^^ 

GGCATCXKJraGAGCGGCTGTIGGCAGCATAGGCXm'GGGA^ 

TTTTGGCAGGTrATGGAGCAGGGGTGKK:AGGCX^^ 

GAGCXKKX3AGATGCCCTtX:A0C»AGGACX?rGGTrAAC^ 

TCC»jrGGOGCXXnAGTanXXK3GGlXX3^^ 
45 TGGGax:AGGGGAGGGGGCrGTGCAGTGGATQAACXXjGCTG 

CXKXKKKjTAACX::ACXmTOCXXX;ACXK:ACTATGTG^ 

AanxnCACTCAGATCXnUIXnAGTCTTACX^Al^ 

ACCAGTGGATCLy^OGAGGACTGCnXJACXKXlATGC^ 

ATCTITGGGATTGGATAlGCACGGTGTroACTCATITC 
50 CAAGCTCCroC»GCX3ATroaXKK^ 

AAGGGAGTCnGGCXKKKKXfA(XKK:ATCATGC^^ 

CAGATCACCGGACATGTGAAAAAOKm^CATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACACGTGGCAlGGAACATTtX<XATrAAaK;GTACACCA(^^ 
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GCAOKXXrrcCXXXKKXKX^AAATTATICTAGGCK^^ 

GGAGTACGTGGAGGTTACGCGGGTGGGGGATTTCCACTACGTGACGGGCATGAC 
CACTOACAACGTAAAGTGCaxnxatAGGTrCCXKKXXXXXSAATTC^ 
GTQGATGGGGTGCXK3TKK:ACAGGTACGCrOCAGC^^ 
5 AGGAGGTCACATIXXTGGTCXK3GC1X:^TCAATA(XTGGT^^ 
ATGCGAGCCCGAACXXjGAaSTAGCAGTGCrcACIT^ 
CACATTACXKKXKSAGACXKSCTAAGCXSTAGGCT^^ 

TGGCCAGcnx:;ATCAGcrrAGa:AGcnxncKKXK:cn^ 

TA(XajTCATGACIXXXXGGACXKnX3ACXJrcATC^ 
10 CAGGAGATGGGCGGGAACATCV^aXXKXjTGGAGTCAGAAAATAAGGTAGTAATT 

TTGGAcrcrncxjAGCCXjcrcc^ 

CXKKXKjAGATCCTGCGGAGGTCCAGGAAATTCXXJrcGAGW 

CA<:XKXXXKjATTACAACCCntX:ACIX3TrAGAGTCCT . 

<XCrcCAGTGGTACACGGGTGTCCATroCCGCXnX3CCAAGGCX^^ 
15 CXntXA<XKjAGGAAGAGGACX3GTKnXXrK^^ 

TGGCXKJAGCKXKXACAAAGACCITaKX^ 

GCXSGCAOKKIAACXKjCXnCTOtnXi^ 

CXXJACXnTGAGTOGTACItOCCATGCmXXXTIGAGC^ 

CGATCrcAGCGACXKK]riCITGGTCTA<XGTAAGC^ 
20 CXnUKK^'GCKXjATGTCCTACACATGGACA^GGC^^ 

GOGGAGGAAACCAAGCKXXXJATCT^TGCACrGAGCAAC^^ 

ACAACTTGGTCTATGCTACAACATCTCXKJAGaK^AAGCC^^ 

TCA(XTrTCACAGACIXK:AGGTCXrKK3AOTAa:A<^^ 

GATGAAGGCGAAGGCGTCCACTVGTTAAGGCrrAAACTTCTATCCGTGGAGG^ 
25 CIXjTAAGCTOAaKXXX:CACATTCX3GCCAGATCrAAATTTG(^ 
GAaJTCaKjAACXTATa:AGCAAGGCa^ 

actixknggaagacactgagacaccaattgacaccaccatcatggct^^ 
aggtittckkx?tccaa(x;agagaaggggggcxxk^ 
attcxx:agatrixk3gggttc!gtgtgtgot 
30 TCCACxxrrcccnx:AGGa:GTGAixKKKnx:Trc 

ACAGCXKK}TCmGTKXJrGQTGAATGCXnX3GAAAGC» 
CnrCXK^ATATGACACXXXKnXSTriTGACTCAACXK^ 
(mGAGGAGTCAATCTA(XAATGTlXnX5A<nTGGCCC^^ 
TAAGGTCGCTCACAGAGOKKnTrACATCXKS^^ 
35 QCAGAACr[Ga3QCTA^XXKXX3(^XKXXKXKXi^QCG 
OjGTAATACOntZACATGTTACITOAAGGCX:^^ 
CKXAGGACTGCACXjATGCTCXnATGa^ 

GOKJGGGGACXXilAAGAGGACGAGGCGAGCCTACGGGlXTTCACGGAG^ 

CTAGATACnUIGaXXXXCIXKKKSACXXXKXXIAAACCAGAATAOG^ 
40 GATAACATCATGCTCXnXX:AATGTGTC\GTC!GC^^ 

GTGTACTATClXI(VCXXX3TGAaXXiA<XACXXXXXnT^^ 

CAGCTAGACACACKXIAGTCAATltXTGGCrAGGCAACATCATCATCT^^ 

CAOCrrGTGGGCAAGGATGATCXTOATGACrcATI^^ 

AGGAACAACTTCAAAAAGCXICTAGATTGTCAGATCTACGGGGCX:^^ 
45 TGAGa^ACTTOACXn'ACCrCAGATCATrCAAraACn^CCATGG^ 

CACTCCATAGTTACKntXAGGrroAGATCAATAGGGTGGC^^ 

ACTTCGGGTA(XGCXXTKKX5AGTCTGGAGACAT(XKK^ 

AGGCTACnmtXX:AGGGGGGGAGGGCKXX:ACITGT^^ 

GGGCAGTAAGGACCAAGCTCAAACrcACIXXAATCa^^ 
50 TTTATCX:AGCIXKjTimiTGCnXKnTACAG^^ 
. TCnXXITGCCC»ACm:»CnX3GTrCATGTGGl^^ 

AGGCATCTATCTACKXXXAACXX^ATGAACXy SGGAGCTAAACACT 

AGGCx:ATanui rmTimjiiTiii rnnurrri 1 1 1 1 iii 1 1 1 1 iii rii rmii 
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TTTTTxnx xTiTmiii ocT LU iTiTi iu:iiiit:i 1 i ccrmxnxxKnaiATcrrTA 

Ga:crAGTCA(:X3GCTAG<nX5TGAAAGGTCXX3TGAGCXXK:^ 
CTCATACTGCKXrrCTCTGCAGATCAAGT 

5 

SEQIDN0:6 : Nucleotide sequoice of DNA clone of HCVreplbBartMan/Avall, where the 
nucleotide change creating Ae Avail site is in lowor case and highU^ted in bold 

CK:CAGK:CCCCX5ATrGCK3GGCGACA<nCCACCATAGATCACritX:CC^^ 
10 TA(nX3TCTTCACXK:AGAAAGCXnCTAGCX:ATGGCGTTAGTATGAGTGT^ 

CCIXXAGGA(X<XXXXnXXXDGGGAGAGCX:ATAGTGGlXn^ 

CAOXKJAATTGCCAGGAaSACXXKJGTCCriTIUITTC 

GGAGATTTGGGCXnXKX:<XCGCX3AGACIXKJrAG<XXj 

AAGGCCTroTGGTACTXKXnXJATAGGGTGCTKK:GAGTG^ 
15 AGACXXmK:^CX:7aX3AGCACX3AATCO'AAAC^^ 

Ga:ATGATroAACAAGATGGATroCACXK:AGGTrciXX:GGO0^^ 

GGCHATTCXKKn'ATGACTGGGCACAACAGAC AAIXXK K^ 

GTICCXKKTCTCAGCXK^GGGGCXKXXXKiT^^ 

GGTGCCCTGAATGAACIXKIAGGACGAGGCAGCXkXKKn'ATC 
20 ACXK3Ga3TTCOTGCXK:AG<nGTGCT(^ 

GGCrcKn'ATIXK3GCGAAGTQCXXKKKK:AGGAl^^ 

tgoxfagaaagtatccatcatggctoalxkiaatgcxks^^ 
<xggctac<jkkxx:attcgacx:accaagcx3AAACatoc^ 
acrcggatggaagcxxknctimcgatcaggatgatctcgac»aagagca^^ 
25 gggcix:xkx3cx:agcc»aackntcgcca^ggc^^ 

GAGGATCTCGTCXnX}AC(XATGG(:X3ATGCXnXKn^ 
ATGGCCGCITITCTGGATrCATCXJACIGTGGCCGGCT 
TCAGGACATAG<XFITGGCrACC«3TGATATTCCTCAAGA^^ 
GCTOACCGCTKXnXXmKnTrACXKnATC^^ 

30 CTTCTATCGCOTCTroACGAGTIXnTCTO 
CKJTAGCXKjGATCAATIXXXKXXXriCTO^^ 
AAGOXKnTGGAATAAGGCCXiGTGTGCXnTKnCT 
TGaXnxnrmKKJAATGTGAGGGCXXGGAAACXJ^^ 
CAlTCCTAGGGGTCnTIXXCCTCICGCC^^ 

35 GTGAAGGAAGCAGTKXTItirroGAAGCnXjrTC 

A(XCTITGCAGQCAGCXK5AACXXXXX:ACXJKK3CXJACAGG 
AGCCACGTGTATAAGATACA(XnGCAAAGG<XKK:ACAAO^^ 
TGAGTTGGATAGlTGTGGAAAGAGTCAAATGGCIXnXXTC 
GGGGCTGAAGGATGCCCAGAAGGTACCXXIATroTATCGGATCT^ 

40 CX3KnXK:ACATGCriTrACATaTGTITAGT^^ 

GAACCAaKKKSACXntKnTTOXnTTCAAAAACAC^ 

TACGGan-ACTCO^AACAGAOGCGAGGOCTACTnKKnXSCA 

ACAGGCCXKK3ACAGGAACCAGGTCXSAGGGGGAGGTa:AAQT^^ 

ACAc/woxnrrccKKKxsACxnxxxm:^ 

45 GTGC03GCrcAAAGACXXrrro(XX3GCXXAAAG^ 

CX:AATGTGGAC3CAGGACCK!GTCGGCTCGCAAGCG(XCCrc 
GACACCATGCACXTGCGGCAGCKXjGAanTrACTI^^ 
GTCATTCCGGTCCXKXGGCGGGGCXjACAGCAG 
CCamnXXTACITGAAGGGCrCTTCXjGGCXK?^^ 

50 ACX]KnXn<KKKL\IXnTICGGGCT^^ 

GGACTITGTACaxntXjAGTCTATGGAAACX^ACTATGa^^ 

GACAACTXIXnxXXXnXXXKKXmACXXK^AGACATTCCAGGT^^ 

(XXXn'ACraGTAGOKKIAAGAGCACTAAGGTGCCGGC^^ 
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GGTATAAGGTGCTIX3TCXnX3AAC<XGTCCGTCG<XGC(^CC^ 
GTATATGTCrrAAGGCACATGGTATCX3ACCCTAACATC^GAA<XGGGGTAAGGAC 
CATCACX:ACGGGTGCXXXX]ATCACQTACrCX:ACCTATGGCAAGT^^ 
GGTGGTIXKn'CIXK3GGGCGan'ATGACATCATAATATGTGATG 
5 CTCACimACCACTATCXnXKKKATCGGCAC^ 
CTGGAGCXKX3ACmnCXjTGClCKKX:ACX»^ 

GCCACATCCAAACATCGAGGAGGTGGCTCroT(XAGCACTGGAGAAAT^ 
TATGGCAAAGCCAIXXXXATCGAGACCATCAAGGGGGGGAGGCACCTCATITI^ 
G(X:A1T(X:AAGAAGAAATGTGATGAGCTCGCCXKX3AAGCTGTC 
10 CAATGCTOTAGCATATTACCXSGGGanTGATGTATCCGTCATAC^^ 

GAOGTCATTOTCGTAGCAACXKJACXKnCTAATGAaKKKriTrAC^^ ' 

ACTCAGTGATCGACKKJAATACATGTGTCACCCAGACAGTTO 

CXJCXSACCITCACXATraAGACXJAaSAC^^ 

CAGCGGCGAGGCAGGACroGTAGGGGCAGGATGGGCATTTACAGGT^ 
15 CCAGGAGAACX3G<XOXXKK3CATGTrCGATrcOX:^ 

AaXXKKXJKnGCTIGGTAOGAGCrCACG<^^ 

GGCTTACXnAAACACACCAGGGTKKXXXntTOC^ 

GAGAGCGTCITrACAGGant:AOCX:ACATAGACXKXX:ATri^^ 

AGCAGGCAGGAGACAACTiaXXTACX:TGGTAGCATACCAG<^ 
20 CX:AGGGCrcAGGCrcx:ACXTO:ATCmt»3GAC^^ 

GCrAAAG(XTACXKrKK:ACGGG(X:AAOG<XCCTXKTrGTAm 

CAAAA<XrAGGTTACrrA<XACACACXXCATAAa:AAATAC^ 
. CXKK:TQACX:rKKiAGGTCGTCAa3AGC^anX 

CAGCKnGGCXDGOGTATKKXJTOACAACAGGCAGCGTGGTCAlT^^ 
25 CATCHTGTCOGGAAAGCXIXKKCAIX:^ 

TICGATGAGATGGAAGAGTGaKXTCACACCrcO^ACATCGAACAG(^ 

AGCrCXKmAACAATrcAAACAGAAGGCAATC^^ 

AGCAAGOKSAGGCKKnXKJrcXXXmKmKJA^ 

CCTIUKKKKXSAAGCATATGTGGAATT^ 
30 CTIXmX^MJICIXKXjrGGCAACCCC^^ 

TCTATCACCAGCXXXKJrCV^CX:ACXX:AACATACXX7I^^ 

GATGGGTGG<XXXXXL\ACTIGCrcXJrCXX^ 

GGCATOGCKK3AGC5GGCrQTIGGCAGCATAGGOCTrG^ 
. TTmKK:AGGTTATQaAGCAGGQGTGGCAGGCX3(XK:m^ 
35 GAGCGQa3AGATGCCCTCCACCX3AGGACX:riXKmAAOCTACr^^ 

TccajixKKXKxxTTAcnxxmxKKKnxxn^ 

TGGGCCCAGGGGAGGGGGCTGTGCAGTGGATOAACCGGCTGATAGCXmx:^^ 

CXK:GGGGTAACX:ACXm:iCaXX:ACGCACTAlXnXKXnX5A<^ 

ACXnGTCAClCAQATOCTCrrcTAGTCTrACX:^^ 
40 ACXAGTGGATCAACGAGGACIXXJnXACGCCATGCIXXGGCrc^^ 

ATGTrnKK5ATroGATATGCA(XK3TOT^ 

CAAGCTCXnXKXXXXJATTCOCJGGQAGTCXXXn^^ 

AAGGGAGTCnXKKXKSGGCGAOGGCATCATGCAAAaiAOCnXKXXAl^^ 

CAGATCACCGGACATGTGAAAAACXKnTCX:ATGAGGATCXn'G(^^ 
45 TGTAGTAACACXmSGCATGGAACATTaXX^ 

GCA(X!C(XriXXXX3GGCGCOUVATrATTCT^ 

GGAGTACGTGGAGGTrACX3CGGGTGGGGGATITCX:ACTACXJTGA<XGGCATC 
CACTOACAACXn'AAAGTGax:GTGTCAGGTItXGGCCCXXX3AAT^^ 
GTGGATGGGGTGOGGTIXK:AC:AGGTACXKritX:AGC^^ 
50 AGQAGGTCACATTCOXKjKXKKKnCAATCAATAa^^ 

ATGCGAGC«:GAACCGGA(X}TAGCAGTGCTCACTrcCATG<:nx:AC^ 

CACATTACGGOKjAGACXXKnAAGaSTAGGCTGGa^ 

TGGCCAGCTCATCAGCTAGCCAGCTGTCriXKXKXJ^^ 
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TA(XXXrrcy^TGACTax:CGGACGCrGA<XTCATC^ 

CAGGAGATGGGCGGGAACATCAOXIGCGTGGAGTCAGAAAATAAGGTAGTAATT 
TKK}ACTCrrTlXX3AG<XGCl<X:A^ 

CGGCGGAGAT(XTGCGGAGGT(XAGGAAAlTaxnx:GAGCGATGC(XATATGG^ 
5 CACGCCaSGATTACAACCXTOCACIXm-AGAGTCUIXKjAAQQ^ 

CCXJIXXAGTGGTACACGGGTGTCCATKKXXKXnXKX^AAG^^ 

CXJnXACGGAGGAAGAGGAOJGTIXnXXTCTCAGj^ 

TGGCXKjAGCItXKXlACAAAGAajriXX3(^ 

GCGGCACX3GCAACGG0CTCrrCCrGAa:AGC0^^ 
10 CXXJACXSTTGAGTCXnACrCCTCXIATGCXOC^^ 

CGAlCTCAGCGACGGGTCnTXKnCTAaXJTAAGaj 

ancroCIXKTCGAIOTCCTAC^^ 

GCXKjAGGAAAOilAAGCTGCXXATCAATGCACro 

ACAACnTGGTCTATGCTACAACATCTCXSCAGC^ 
15 TCACOTTOACAGACroCAGGTCXJroGACXJAO^ 

GATGAAGGCXjAAGGOGTCCACAGTrAAGGCTAAACTICTATC^ 

Cim"AAGCTCACX3CCCCCACATnXKKX:AGATCT 

GA0GTaXK5AACXnATa:AGCAAGGCX:GTrAAC^ 

ACITGCKKJAAGACACTOAGACACCAATIGACACCACXATCAT^ 
20 AGGTITIUIGCXjTCCAA(XAGAGAAGGGGGGCXXX:AAGCCAGCT^ 

ATIXXCAGATTIXKKjGGITCXSTCTGTGCG 

TXXACXXnmntyV^GGCXXmSATGGGCIUI^^ 

ACAGCGGGTajAGnrrcCTGGTGAATGCXTOGAAAGCGAAGAAAT^ 

CITCGCATATGAC^CXXXXnxmTroACrcAACGGTCACTOAGAATGAC^ 
25 GTTGAGGAGTCAATCrACX:AATGTTGTGACTIXK^ 

TAAGQTCXKJTCACAQAGCXKKJlTrACATCXKK^ 

GCAGAACTGCGGCTATCG(XXK3TGCCGaKX}AGCGGT^ 

CXKJTAATAaXrrcACATGTTACnTGAAGGiXGC^^ 

CTCXIAGQACTGCAOGATGCICGTATGOGGAGACXSACCTroTC^ 
30 GCXKXKKK3ACXXL\AGAGGACX}AGGOGAG<XTACXKKKXnTCV^ 

CTAGATACrCKKXXXXXXnXKKK3A(XCXKXX;AAArc 

GATAACATCATXKnCX^TOCAATGTGTCAGTCXKXjCACGATG^ 

GTGTACTATCrrcA(XXXm}AaXX:ACX:ACXXCCC^^ 

CAGCTAGACACACTOIAGTCAATianXKKTAGGCAACATCA^ 
35 CACXnTGTGGGCAAGGATGATCOGATGACTCATr^^ 

AGGAACAACTTOAAAAAGCXXJrAGATrQTCAGATCrrAOGG 

TGAGOiACTIGACCTACXrrcAGATCATr^^ 

CACT(XATAGTrACTCrnXAGGTCAGATCAATAGGGTGGC^ 

ACTIt3GGGTACCGCXXTIGCX3AGTCrKK5AGACATC^ 
40 AGGCTACTOTC<XAGGGGGGQAGGGCroa:ACI^^ 

GGGCAGTAAGGAOCAAGCnCAAACTCACIXXAATCCXXK^^ 

TITAT(XAGCrKK3TroGTTGCKKnTACAGCGGG^ 

TCnCGTGCCXXjAC(XOKnXjGTrCATGTGGT^^ 

AGGCATCTATCTACnXXXXIAA OCXjATG AACXSGGGAcCT^ 
45 AGGCCATXXTGTTTTITIXXXnTTm 

TmrnxnxxnriTrTTnxxTC^^ 

TAGaXn-AGTCAOGGCTAGCnXSTGAAAGGTCOGTGAGCXXKnTGAC^^ 
TGCTCATACKKKXnurcrrGCAGATCAAGT 



50 

SEQIDNO;? : Nucleotide sequence of DNA clone of HCV adaptive leplicon I, wfaere liie 
amino acid gmetated by flie deletion is identified in lower case and hi^ilighted in bold 
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GCCA^GCXXXXGATKKKKXKX}ACACnX^a^ 
TACreTCTrcAOJCAGAAAGCGTCTAGCCATGGam'AGTATC^ 
(XTCCAGGAOXmXTCCXXjGGAGAGCCATAQTGGTCTGC^ 
CACX^GAATIXKXAGGACGACCXSGGTCCTrrCTK^ 
5 GGAGATITGGGCXjTGCC(XCGCGAGACrGCrAGCXXjAGTAGTGTTGGGTC^ 
AAGGCCTTOTGGTACTGCCTOATAGGGTGCTTGCGAGTGCCaXKKJA 
AGACXX3TQCA(XATGAGCACXjAATCOAAA0CTCA/^GAAAAACX:AA^ 
GOCATGATTGAACAAGATGGATTCCAOK^AGGTIUrCCGGCCGCTrGGQTGGAGA 
GGCTATTa3KKrrATGACTGGGCACAACAGACAATCX3G^ 

10 GTTCCXKKnxnCAGCXKIAGGGGCGCCXXKnTC^^ 

GGTGCCCTGAATGAACTGCAGGAOSAGGCAGCXjCGGCrATC^ 
ACGGGCGTTCCTTGCXKJAGCIGTGCKXJACGT^^ 
GGCTGCTATTGGGCGAAGTGOXKKKK^AGGATCTa^ 
TGCCGAGAAAGTATCX:ATCATGGCK}ATGCAATGCXKKXKKrrG^^ 

15 CXXKKnACCIXKXXLVTrcGACX:ACCAAGC»AAAC^ 

A<nx:XK3ATGGAAGa:GGTCTTGTCGATCAGGATGATCTGGAa5AAG^ 
GGGCTCGCXSCCAGCCGAACTCTKXKXlAGGCrcAAGGC^ 
GAGGATCrimiajraACXX:AT0GC»ATGCOXKmXKXX^ 
ATGGCCGCmrCTGGATTCATCGACIXnXKSCCXSG^ 

20 TCAGGACATAGCXnTGGCTACCCGTGATATTCCTGAAGAGCTKK3^^ 
GCTGACCGCTnxria3TGCTITACXK3TAT^^ 
CTTCTATCGCCTIUITGACGAGTIXjrKnXSA 
CrcTAGOGGGATCAATT<XGOO(XTX7IXXOtXXXXXX^^ 
AAGOCXKnTGGAATAAGGOXKJTffreCGTITGTC^ 

25 tgccgtctttixkk:aatgtgaggg<xx:ggaaacctgg<xct^^ 

CATTCO"AGGGGTCriTrc(XXJIUreGCX:A^ 

GTGAAGGAAGCAGTTCXnxnGGAAGCITCTrcAAGACAAACAAa^^ 

AC(XriTGCAGGCAGCX3GA^(XCCCCA<Xn<3Ga5ACAGGl^^ 

AGCCACXSTGTATAAGATACACXntKlAAAGGCXS^^ 

30 TGAGTTGGATAGlTGTGGAAAGAGTCAAATGGCIUKXnCAAGCGTATT^^ 
GGGGCTCAAGGATGOGCAGAAGGTACXXXIATTCTATGGGATCT 
CXKmKIACATGCmACATQTGTTrAGTCXJAGGrrAAAAAAC^^ 
GAA(X:ACGGGGACGTGGTriTCXrrnt5AAAAACACX3ATAATAa:ATGGC^^ 
TAaKKXnACIXXCAACAGA<XKX3AGGCXrrACTIGGCTGC^ 

35 ACAGGCX^GGGACAGGAACCAGGTCGAGGGGOAGaTCX^AAGTGGIC^ 
ACACAATCrmxnGGCGACXnGCXnCAATGGCGTGTGTrGGAC^^ 
GTGCCXKKnX^AAAGACXCTTGCCGGCXXAAAGGGCO:^ 
<XAATGTGGA(XAGGA(XriXX3T(XK3CTGGCAAGC^^ 
GACACCATGCACOXKXKX:AGCT0GGA(XTITACnrrGGTCATO 

40 GTCATIXXGGTGCXSCOGGCGGGGCXSACAGCAGGGGGAGCXjrAC^^ 
CXXGTCTCCTACTroAAGGGCTtnTCXK^ 
AOiCTOIXKKKJATCTITCXKKKn^ 
GGACTTTGTACCmrCGAGTCrATGGAAACXiV^CT^^^ 
GACAACnmiXXXXJKXJGGOCGTACXXSCAGACATTCCAGGTGGCCCAl^ 

45 COXTACTGGTAGCGGCAAGAGCACTAAGGTGCXXKKnGCG^ 
GGTATAAGGTGCnGTCXnGAAC00GTCXXnX3GCm^ 

GTATATGTCTAAGGCACATGGTATCGACCCTAACATCAGAAOCGGGGTAAGGAC 
CATCAa:ACGGGTGCOXCATCA.a3TACnxXACXn-AT^ 
GGTGGTTGCICTOGGGGCGCCTATCACATCATA^ 
50 CTGACKXiACCACrATCCKKKK::^^ 
Cl<K3AGaKX3ACnXmX3TGCItXKX^ 

GCX^ACATCCAAACATaSAGGAGQTGGCIUroTa^AGCACKKSAG 
TATQGCAAAGCXIATCCaiATCXxAGACXIATCAAGGGGGGGAGGCACCT^^ 
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GCCATT(X:AAGAAGAAATGTGATGAGCnXKX:GCGAAG<nX3TCCXKK^ 

CAATGCTCTAGCATATTAOXKJGGCXriTGATCTATCO^ 

GACmx:ATTGTCOTAGCAACGQACGCrcTAATGAC^^ 

ACTCAGTGATCGACTGCAATACATGTGTCACCCAGACAGTCGACnTCAGCCrGGA 

5 CCCGACXnTCACCATTGAGA(:X3ACGACX:X3TG(XACAAQACGCGGTGTCA(^^ 
CAGCGGOjAGGCAGQACKKn'AGGGGC^QGATGGGCAm^ 
CCAGGAGAACGGCCCIXXK3GCATGTimATrCCKXKm-C^^ 
A(XKXK3GCnX5TGCTIXK5TAC»AGCrcACGCC(^^ 
GGCrrACO"AAACACA<XAGGGTTXKXXX3TCTGa:^^ 

10 GAGAGCGTCTITACAGGCOCAaX^ACATAGACGOCCATn^^ 

AGCAGGCAGGAGACAACrrcCCXTACCTGGTAGCATACXAGGCrACGGT^ 
C5CAGGGCrcAGGCrcCA(XTCCAT03TGGGACCAAATGTGG>^ 
GCTAAAGCCTACXKnXjCACCGGCCAACGCXXXnXKn'GTATAGGC^^ 
CAAAACGAGGTTACTACXACACACOXATAACCAAATACATCATGGCATGCATGT 

15 a3GCrGACCTGGAGGTCXnCAC«AGCA<XTGGGTGCnX3^^ 
CAGCTCKKjCOKXiTATKKXJIXU 
CATCTKnmKJAAAGCCXKKXJATCATTC^^ 
TTOjATGAGATGGAAGAGTGCXjCOXZACACXntXXnTACAT^ 
AGCOtXKXXfAACAATTCAAACAGAAGGCAATCXKj^ 

20 AGCAAGCGGAGGCroCKKnOXGTXKnGQAATCX:^ 
CCTTCTGGGCXjAAGCATATGTGGAATTTCATCAC^^ 
CTTQTCXZACIUIXKXnGGCAACaX^ 

TCTATCACx::AGCXX5QcrcAcx:ACXx::^^ 

GATGGGTGGCa3CCX:U^CriTCCIXX^ 

25 GGCATCGCKKJAGCXKKnxmXKK^AGCATAGGO^^ 

TirKKjCAGGTTATGGAGCAGGGGTGGCAGGCGCGCrcGTGGCCTTrAAGGTCAT 
GAGCGGCGAGATGCXXTCXIACCXjAGGACXnXKm'AAaTrACTCCC^^ 
TCCCXTGGCXjCCCnAGTCGTCGGGGTajTGTGCGCA^ 
TGGGCCCAGGGGAGGGGGCTGTtK^AGTGGATGAAOXKKn'GATAGa^^ 

30 aKXKK3GTAA(XACGlXnXXXXX:ACXK::ACTATCTGC^^ 
ACmmt^CnX^AGATCCICrcTAGTCnTACX^^ 
ACCAGTGGATCAACGAGGACIXKnXX::ACXKX:ATGCTCX^^ 
ATGTITGGGATTXK3ATATGCACGGTGTK3ACK3AT^^ 
CAAGCIXXTGOCGCXJATnKXXKKSAGlCCXXrrr^^ 

35 AAGGGAGTCrGGaKKKKXjAaKKlAL^^ 

CAGATCACCGGACATGTCAAAAACJGCnTaLVTGAGGATCOT 
TGTAGTAACACXnXKK:ATGGAACATTCOXATrAA(^^ 
GCAOjCCXiriXXCCXKKXjCX^AAATTAT^^ 
GGAGTACXn<K}AGGTTACGCXKK3TGGGGGATITCCACTATO 

40 CACTGACAAOjTAAAOTGCXXXjimXZAGGTn:^^ 

GTGGATCGGGTGCGGlTGCACAGGTACGCrOCAGCm^ 
AGGAGGTCACATTCXJKKnXXKKKJTCAATC^^ 
ATGa3AGaXGAACXXKJACX3TAGCAGTGCnCACTT€CAT^^ 
CACATTACX3GCGGAGACGGCrAAGCX3TAGGCnKKX:AGGGGATCriXX(^^ 

45 TGGCCAGCrCATCAGCTAGCX^GCrotiicTCTrrOT 

GATGAGAGGGAAGTATCmiTCCGGCXK5AGATCXnXKX3GAGGT^ 
CCTCGAGCGATGCOCATATGGGCACGCCCGGATTACAACXXnXX 
CCKK}AAGQACXXX!QACTACXnmTO:)AGTOGT^ 
TG<XAAGGCCa:rCXDGATACX:ACCTCX:AC^ 

50 GAATCTACmrGTUITCrKKXnTGGCGGAGCrcGCC^ 

CCGAATCX3TCGGCXXnCX}ACAGaKK:VV(XKX:AACX3GC^^ 

CWACGACXKKXj^AOKXKKSATCXXiACGTTCAGTCGTAC^^ 

GAGGGGGAGCXX3GGGGATCXXX3ATCTCAGC»ACXKKjTCTI^^ 
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GAGGAGGCTAQTGAGGACGTCGTCrKMntKnXXSATGTCCTAC^ 

gccctgatcacgccatgcgckk:ggaggaaaccaagc^^ 

AGCAACTCriTKKnm3TCAa::ACAACTK^ 
CAAGCX]T<XXKK:AGAAGAAGGTCACXnTroACAGAClX^ 

5 ACrA<XGGGA<XnrGCrcAAGGAGATGAAGG(XfAAGGCGTa:A<:^ 
AACnCTATCCXmKJAGGAAGCCTOTAAGC^ 
TAAATITGGCTATGGGGCAAAGGACXjTCXXSGAAOCTATCC 
CCACATCCX3CTCXXnGTGGAAGGACTIXK7KK5AAGACACTG 
ACXDACX^ATCATGGCAAAAAATGAQGTITrcroaS^ 

10 CXK^AAGCCAGCTCGCXJrrATCmATTCCXIAGATTI^^ 

AAATGGCanTTACGATGTGGTUIXXACXXnmnCAGGCay^ 
TACGGAITCX:AATACIUKXnX3OACAGOGG0TCX?AGTI€CT 
AAGCGAAGAAATGOXTATGGGCmXKATATGACACXCGCTGT^ 
GGTCACrcAGAATGACATCCGTGTTGAGGAGTCAATCTA(X:A^ 

15 GCC<XCGAAG<XAGACAGGCCATAAGGTX3GCrcACAGAGC^^ 

GGCCmiTCACTAATIUrAAAGGGCAGAACnGCGGCTATCXKXXK^^ 
GCXKnGTACIGAC«Aa:AGClXKXK?rAATACXX^ 
TG<XK3COGT0GAGCroCGAAGCntX:AGGACI^^ 
GACXiriTGlXXmATCIGTCAAAGCXSCGGGGACX^ 

20 aKKKX:iTCACXXiAGGCTATaACrAQATACrciXK^^ 
AA0CAGAATACGACITCGAQTK5ATAACATCATG^ 
GCAOjATGCATCrroGCAAAAGGGTGTACTATCTCACCC^ 
CTIGCGCGGGCrGCGTGGGAGACAGCTAGACACACi^^ 
GCAACATCATCATGTATGCGOXACXTKJ^^ 

25 TTTCTTCKXATCXnTCTAGCTCAG^ 

TCTACGGGGCCTCTTACTOCATTGAGCX^ACITGAanACXri^ 
ACnXX:ATGGC<nTAGCGCATriTCACrcCATAGTrACTC^ 
GGGTGGCITCATGCXjrcAGGAAACITGGGGTACXXKX^ 
TCGGGCXJAGAAGTGTCCGCGCTAGGCrACIGTCCX:^ 

30 TTGTGGCAAGTACXnrCTTCAACnXKKKIAGT^ 

ATCCXXSGCKKXSTOCCAGTIGGATrTATOCAGC^^ 
GGGAGACATATATCACAGCXJrGTCTOGTGCmSACmXKnGGT^ 
CTACIOITACTITCnXJrAGGGGTAGGCATCTATCT^ 
GGACCTAAACACTtXAGGCCAATAGGCCATOCraTI^^ 

35 iiTiTiTrnTmiTiTiTiTiTiTrmrric^ 

TTCTTKXnTKKmXKnXXATCTTAGC^^ 
GTGAGCCGCITQACTCCAGAGAGTGCIGATACIXKKX:^^ 



40 SEQIDN0:8 : Nucleotide sequence of DNA clone of HCV adaptive replicoa VI, where 
nucleotide changes are in lower case and highlighted iiirbold 

G0CAGCXXXXX3ATrGGGGGC!GACACTCCA(X:ATAGATCAC^^ 
TACTOTCnrCACGCAGAAAGCXnCTAGCCATGGCGTTAGTATGAC^^ 
45 CCItXAGGACJCXXXXXJrCXXXKKSAGAGCX:::^ 

CAOXKSAATItKXAGGACGACXXKSGTtXTITCT^^ 

GGAGATITGGGOGTGOCXXXX3CX3AGACnGCTAGCXX}AGTAGTGTTGGGT^^ 
AAGGCXTKnXKJTACnXKXnGATAGGGTGCTIXKJGAGTGC^ 
AGACCGTGCACXATGAGCACXJAATCXjrAAACCTCAAAGAAAAACCi^ 
50 GCCATGATTGAACAAGATGGATIGCACGCAGGTTCTCCXKKXXK^ 
GGCTATTCXKKn'ATGACIGGGCACAACV^GACAATaj^^ 
GTTXXGGCIX3TCAGCXK}AGGGGCGaXX3GTTCTIT^ 
GGTGaxriGAATGAACTCCAGGA<XfAGGCAGCGCX3GCrATCX5T^^ 
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ACXKXKXSTKXTTGaKlAGCTGTGCTOiACGTTG^ 

CKKJIXKn'ATKKK3CGAAGTGCXXKKKK:AGGATCTCCT 

TGiXOAGAAAGTAlXXATCATGGCraATGCAATGCGGCGGC^^ 

aXKKn'A(XrKKXX:ATTCmCX:^CX:AAGCGAAACATC^ 
5 ACTCGGATGGAAGCX^GGTCTroTCGATCAGGATGATCrreGACGAAGAGCATCAG 

GGGCTCXMXKXLVGCCX3AACimTCGa:AGGC^ 

GAGGATCTCGTOGTGACOCATGGCXJATGCCIGCTI^^ 

ATCKKXXXnTirCKKjATTCATCGACTCTGGCCGa 

TCAGGACATAGanTGGCTAC(XGTOATATIXK:TOAAGAGCIT^ 
10 GCTOAOOGCTKXTCXnGCITrACGGTAT^^ 

CTICTATaKXrrcriTGACGAGTTCTIU^ 

CrcTAGOGGGATCAATTOXKXOTrCTtXX^ 

aagcokjitggaataaggaxkstgtgcgtttgtc^^^ 
txkx:gtctitkkk:aatgtgaggg<xxxk5aaacctcg<xc^^ 

15 CATTCCTAGGGGTCTITCXXXnxnXXKX:AAAGG^ 

GTGAAGGAAGCAGTItXJKnXjGAAGCTIXnTCAAGA^ 
AaxnTKK::AGGCAGaK5AACOXCCA(Xr^^ 
AGCCAOTIOTATAAGATACACCnK^AAAGGCGGC 
TGAGTrGGATAGTrGTGGAAAGAGTCAAAlXKMnur^^ 

20 GGGGCrGAAGGATGCXXAGAAGGTAOXXJATTXnATGGG 
CXKnGCACATGCrTTAC^TGTGTITAGTCXiAGG^^ 
GAACCAOKKKJACGTGGTriTCCITroAAAAACACGATAATACCAT^^ 
TACGGOTACrcCCAACAGACXKX3AGGCCrACTIX3GCrGC^^ 
ACAGGCXXKK3AaVGGAA(X::AGGTC»AGGGGGAGGTCX:::AAGT^^ 

25 ACACAATCriTrCCroGCGA(XrKK:GTCAATGGa31^^ 

GlX3CCGGCTCAAAGAC(XnTG<XXK3arAAAGGGCCX::AATCAC^ 
a:AATGTGGACCAGGA(XrrCXm:GGCTGGCgAGCGCCCC(XGGGG(^^ 
GACACCATGCACXnGCXKSCAGCKXKSACCITrACTO 
GTCL\TraXK3TGa3CCGGCGGGGC»ACAGCAG^ 

30 CCCXJnrCIXXrrACITGAAGGGCKnTC^^ 
ACX3CTGTQGQCATCTrrcGGGCIGC^^ 
GGACTTTCTAOCOGTCJGAGTCTATGGAAAaiACTAT^^ 
GACAACTOnXXXXrnXGGCXIXStACC^ 
CXXXJTACTGGTAGOGGCAAGAGCACTAAGGTGCXXiGCKj^ 

35 GGTATAAGGKK7ITGTOCrGAAaXX3TCXX?I^^ 

GTATATGTCTAAGGCAC7^TGGTATC»A<XCTAACATCAGA^ 
CATCAO^ACGGGTCCmXATCACGTACIXXACCrAT^ 
GGTGGTKKnXJKKKKKKXKXTATGACATCATAATATGTGATGAGT^^ 
CTGACmXSAOIlACTATCXJIXKKKZATCX^^ 

40 CroGAGCX3C»ACIXX!TCGTGCKXK3^ 

GCCACATCCAAACATCGAGGAGGTGGCIXnX3TCX:AGCACKK5 
TATGGCAAAGCX:ATaXX:ATCXjAGA0CATC7U^ 
GCCATTCCAAGAAGAAATGTGATGAGCIXXKXXKXjAAGCraT^^ 
CAATGCIGTAGCATATTACXXKKJGCCrTOATGTATCCmr 

45 GACXnCATTGTCOTAGCAAOKJACXKJICr^^ 
ACTCAGTGATCGACKKilAATACATQTGTCACCC^ 
COXtACCTKAO:ATTGAGAC»ACGACCXnXK^ 

CAGCGGCX5AGGCAGGA<nGGTAGGGGCAGGATGGGCATTrACAGGTriX^ 
(XAGGAGAACGGCCXn'CGGGCATGTICGATTXXnCGGTrCIGT^^ 
50 ACXKXKKKn'GTXKnTGGTACGAGCTCAOMXXrGC^ 

GGCTTACXrrAAACACACXAGGGTTGCXXXSTCrGCCAGGAC^ 
GAGAGCGTCrTTACAGGanX^ACXXACATAGACXKXXAl^^ 
AGCAGGCAGGAGACAACITCXXXJrACXn'GGTAGCATACX^ 
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CCAGGGCrcAGGCnXXACX:T(X:ATCXnXK3GA(XAAATGT(^ 
GCTAAAGCCTAaXJTGCACGGGCCAACGCaxn'GCTGTATAGGCTGGGAGC^ 
CAAAACGAGGTTACTACCACACACCCXATAACXIAAATACATCATGGCATGCATQT 
OKKJrGACCTGGAGGTCGTCACGAGCACCnXKKjTGCrGGTAGGCGG^ 
5 CAGCTCnXSGCCXKXJTATnKXTOACAACAGGCAGOGTGGTCATroTGGG^ 
CATCTIXjTOCGGAAAG(XGGCX::ATCATim:GACAGGGAAG^ 
TltXfATGAGATGGAAGAGTGCXKXnCACACCimnTACA^ 
AGCTCGCCGAACAATrcAAACAGAAGGCAATCXKK3TTGCIX3CAAACAG<X 
AGCAAGCGGAGGCTCCKKnXXXGTGGTGQAATCCAAGTCGCGGAC^^ 

10 CCTIUIGGGGGAAGCATATGTGGAATnCATCAGCGGGATACAATATn^^ 
CTTOTCCACrCTGCCrGGCAACCaXXXSATAGCA^^ 
TCTATCA(XAG<XCXKJrcA<XACXX:AACATACXXT^^ 
GATGGGTGG0CGC0CAACTIGCKXntXXAGCX3Cn^ 
GGCATCX3CIXK}AGCX3GCK}TTGGCAGCATAGGCCITGGGAAGGT^^ 

15 TrTTGGCAGGTTATGGAGCAGGGGTGGCAGGOKXKnCGTC^ 
GAGCX3GCX3AGATG(XCKX:ACXX}AGGAaTOGTrAACCT 
TOCCCTGGCGCCCTAGICGTCQGG(mxnGTGCGCA(^ 
TGGGC(X:AGGGGAGGGGGCTGTGCAGTGGATGAA<XGGCK5ATAGamx:XK^ 
CXKXKKKSTAACCACGTCriXXCO^AaK^ACT^^ 

20 ACxmncACTCAGATCXjrcrrcrAGTCT^^ 

ACX^AGTGGATCAAOGAGGACrGCrcCAOGCX^ATGC^^ 

ATGlTnKK}ATKK}ATATGCACXK]nxmXjA<nXj 

CAAGCKXTOCXXKXiATKKXX3GGAGT0(XCriTCTK^ 

AAGGGAGTCroGaKK3GCXjACGGCATCATGCAAAa:ACX;^^ 
25 CAGATCAaX3GACATGTGAAAAAaKnTCX:;ATGAGGATanX^^ 

TCTAGTAACACGTGGCATQQAACATimXATTAACGCGTACA(X:ACGGG<^^ 

GCACXKXXJIXXOCXSQCXKXAAATrATrCTAQGGCJGCl^^ 

GGAGTACXJTGGAGGTrACXKXK3GTGGGGGATrra:ACTACXnX3ACGGGCATGAC 

CACTGACAAOSTAAAGTGCCXXmnxIlAGGrrCXXKKXX^ 
30 GTGGATGGGGTGCGGTTGCACAGGTAOKTOCAGCXjTGCAAACC^^ 

AGGAGGTCACATTCCKKiTOGGGCTCAATCAATAC^^ 

ATGCX5AG<XX3GAAaXMJACX?rAGCAGTGCTCACTK^ 

CACATTACXKKXKSAGACXKKnAAGCGTAGGCTGGCCAGGGGAl^^ 

TGGCCAGCrcATCAGCrAtCCAGCIGTCrKja^^ 
35 ACCCGTKATXiACrCCa^QGACOCIQACC^^ 

AGGAGATGGGCXKK3AACATCA<XCGCXnX3GAQTCAGAAAATAAGGTAGTAATn' 

TGGACICITICGAGCXXXntX^AAGCXKJAGGAGGATGAGAGGGAAGTATC^^ 

aKKXK>AGATXXrKXXK5AGGTCCAGQAAATICCX:rrOT 

CACG<XCXK}ATTACAACCClXX:ACIXm'AGAGTCCK3GAAGGAC<^^ 
40 (XXnXDCAGTGGTACAaKSGTGTCCATKKXGC^^ 

CXJTCCAOGGAGGAAGAGGACXKnTGTCCTGTCAGAAT^ 

TGGCGGAGCTCGCX:ACAAAGACXnTCXKK:AGCIXX» 

GCGGCAaKK::;AA<XKKxricixxn«Acx:AGa^ 

CCGACGTn3AGTCGTACTCCItX:ATG<XCC(XXJITGAGGGGGAG^^ 
45 a5ATCrrcAG<XfAC5GGGTCriGGTCTAaxn7VAG<^ 

CQTCKKnGCnCGATGTOCTACACATGGACAGGCX^^ 

GCGGAGGAAACX^AAGCIXKXXJATCAATGCACTOAGCAAC^^ 

ACAACITGGTCTATGCTACAACATCnXXKZAGC^^ 

TC^CXnTTOACAGACIXKIAGGTCXnGGAOGACCACTACCGW 
50 GATGAAGGCX3AAGGCXnCCACAGTrAAGGCrAAACrTCTATa:GTGGAGGAA^ 

(nxyrAAGCTOACXKXXXXlACATnXSGCCAGATCT 
. GACGTCXXjGAAan-ATCCAGCAAGGCCGTTAACCACATCCGCTCXGTGTGGA^ 

ACriGCIXK3AAGAC:ACnX3AGACACX:AATTGA<^^ 
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AGGTITICTGCGTCCAACXAGAGAAGGGGGGCCGCAAGCCAG^^ 

ATKXCAGATTTGGGGGTTCGTGTGTGCXSAGAAAATGGCCC^^ 

T(XACCCT(X(n^CAGGCXXnXjATGGGCTXn^ 

ACAGCGGGT(^AGTTajIX3GTCAATG<XTGGAAAGCGAAGAAAT^ 
5 CirCXKJATATGACAaXXKnxmrraACTCAAC^^ 

GTTGAGGAGTCAATCTAOJAATGTlXnGACnTGGCXXX^ 

TAAGGTCGCTX::ACAGAGa3GCTITACATCGGGGGaX<^^ 

GCAGAACTGCXjQCrATCG<XGGTGCCGCXXXjAGCXK3TGTACTC 

OKnAATACXXnCACATGTrACTIGAAGGCXXXJrGC^^ 
10 CTCCAGGACTGCAa3ATGCKX}TATGCXK}AGACGACCTIXn^ 

GOKXKKKjACCCAAGAGGAaSAGGCGAGCCTACGGGCXnT^ 

CTAQATACrrcroCCmX!CrroGGGA(XXXKXX^ 

GATAACATCATGCnXTXX:AATGTGTCAGlXXK:GCAa3ATGCATCTGGCAAAA(^ 

GTGTACTATCrcACCCGTGACCXXA(XACCCCC^^ 
15 CAGCTAGACACACrcCAGTCAATrcOGGCTAGGCAACATC^^ 

CACXJITGTGGGCAAGGATGATCXnXjATGA(nX::ATTI^^ 

AGGAACAACITCAAAAAGCCCTAGATIGTCAGATCTA(XK^^ 

TGAGCCACTTGACCTAOCrcAQATCATTCAACGACnXXATGG^^ 

CACnXX^ATAGTTACICTCCAGGTGAGATCAATAGGGTGGCTrcATGCC^^ 
20 A(nTGGGGTA(XGCCOTGCX}AGTCTGGAGACATCXKK^ 

AGGCTAClXmXX:AGGGGGGGAGGGCTGCX:ACITGTGG<:^ 

GGGCAQTAAGGACCAAGCrcAAACrcACnOCAATCCX^ 

TITATCXIAGCnXjGTTCXSTroCTGGTrAC^ 

TCTCXjTG<X<XrACXXXXKn"GGTrcATGT^ 

25 aggcatctatctackxxx:aa«x?atgaacxkkk^ 
aggccatxxtcttttttkxxntttt^^ 

TTTTT(nXXTITITITTKXrrc^^ 

GCXXnAGTQ^CGGCTAGCnGTGAAAGGTCXXnGAGC^^ 

CroATACTGGCClXnCIGCAQATCAAGT 

30 

SEQ ID NO:9; Nucleotide sequence of DNA clone of HCV adaptive Tq>licon wfaere 
nucleotide changes are in lower case and bigJilisJited in bold 

35 G<XAGC(XCCX3ATrGGGGGCX3ACACnPCCACCATAGATCACIt^ 
TACrcriCITCACGCAGAAAGCXnCTAGCX:^^ 
CCriXXAGGAaX00(XnxXXXK3QAQAG0CATA^^ 
CA(XGGAATTGCCAGGACGACCXKKnCCTTKnTGGATCAAC^^ 
GGAGATTIX3GGOGTG<XCXXXKX3AGACrGCTAGOCX3AGTAGTGTKKK^^ 

40 AAGGCCITGTXXJTACIXKXJKMTAGGGTGC^^ 

AGACXGTGCACXATGAGCACXJAATCCTAAACXZrrCAAAGAAAAAC^^ 
GaATGATTGAACAAGATGGATKK^ACXK^AGGTrc^^ 
GGCTATrcGGCrATGACTGGGCACAACAGACAATCXjGCIXKTC^ 
GTTCCGGCnxnCAGCXK:AGGGGCGCm3GTICITI^^ 

45 GGTGCOJraAATGAACTGCAGGATOAGGCAGCGCGGCrATCmX^^ 
AOSGGCXJITCXTraCGCAGCI^^ 

ggckkh'atkkkkxsaagtgcxxkkkm^ 

tgcxx3AGAaagtatcx::atc:atggctgatgcAatgcxkkxk3^ 

cxxk]k7racxn<kx:cattcga(xaccaag<:x3aaacatcgcatcgagc^ 

50 ACTCGGATGGAAGCXXKnCITGTCGATCAGG^ 
GGGCIXXaKX:AGCX»AACIXnTtXi^^ 
GAGGATCTCGTa3TGACXX:ATGGCGATG<XTGCITGC^ 
ATGGCXXKnTITCrKK}ATrcATCGA<nxnXK3^ 
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TCAGGACATAGCGTKKXTACCCGTGATATTtKnXS^GAGC^^ 
GCTOAOXJCTrCCTCGTGCITrACGGTATCXKXXKnC 
CTICTATOKXJrrCTrGAOGAGTIXjrTCTC 
CrCTAGCGGGATCAATTCCGOXXTCimnmXXXXm^ 
5 AAGCXGCTKMJAATAAGGCCGGTGTGCGTITGTCTATATGTTAIT^ 
TGCX:GTCmTGGCAATGTGAGGGaXXKjAAA(XTGGCXXnX^^ 
CATrcCTAGGGGTCTTT(X(X]TCTCXKX:^^ 

GTGAAGGAAGCAGlTCCTXnXKjAAGCTIOTGAAGACAAACAACGTC^ 

AC(XTITGCAGGCAGCGGAACX:(XCCACXTroGCGACAGGTGC^^ 
10 AG<XACGTGTATAAGATACAOCTGCAAAGGOGGCACAACm^ 

TGAGTIGGATAGTTGTGGAAAGAGTCAAATGGCrcri^^ 

GGGGCraAA(jGAlXKXX:AGAAGGTACCCCATim'ATGGGATCTCA^^ 

CX3GTGCACATQCITrACATQTGTTTAGTC»AGGT^^ 

GAACmaKKKJACrnXKmrnXTTTCAAAAACA 
15 TACGGCCTA<J[XXCAACAGA<XKX5AGGCXJrACITGGCT^^ 

ACAGGaXKK5ACAGGAA(XAGGT0GAGGGGGAGGTCCAAQTGGTCntXA(X^ 

ACACAATCirrcXJKKjCGACXiriGCGT^^ 

GTGCCGGCTCAAAGACXXTIGCXXKKXXAAAGGGCXX^AATCA^ 
CCAATGTGGACCAGGACOt^GTCGGCnXKK^AAGCGaXCCa^^ 

20 GACACCATCCAOCTGCXSGCAGCmKiACCTrrACOT^ 

GTCATiaXKnXKXKXGGCXKSGGCGACAGCAGGGGGAGCXT 
CXXGTCrcCTACITGAAGGGCnXJITCGGGCXSGTC^ 
ACGCTGTGGGCATXJrnCGQGCTGCCGT(n(X:A^ 
GGACTTItnACXXXnXXSAGlXnATGGAAAOCACTATGCGC^^ 

25 GACAACTCGTCmnmKXX:GTACm:AGACATIXXAGGTGG^ 
CXXXn-ACTCGTAGCGGCAAGAGCACrAAGGTGOXKKnGC^ 
GGTATAAGGTGCnGTCOGAA(XXXmXXnXXKX3GCCACCCT 
GTATATGTCTAAGGCACATGOTATCCACCCTAACATCAGAACCGGGGTAAGGAC 
CATX^ACCACGGGTXKXXXXIATXIACGTAC^^ 

30 GGTCGTIXKJKJrGGGGGaKXTATGACATCATAATATGro 
CTGACnXJGACXIACTAlXXJlXKiKKi^T^^ 
CroGAGC»CQACrCXjrCX3TGCiaKX:A(XXKT^ 

GCCACATCCAAACATCGAGGAGGTCGCTCTGTCCAGCACIXKjAGAAAT^^ 
TATGGCAAAGCX:7VTCm:AT0GAGAa::ATCAAGGGGGGGAG(^ 
35 GCCATTOCAAQAAGAAATGTQATQAGCIXXKXXKXJAAGC^^ 
CAATGCIGTAGCATATTACCGGGGanTCATCrrAT^^ 
GACGTCATIGTdGTAGCAACGGACXKnxnV^TGAC^ 
ACrcAGTQAT03ACTGCAATACATGTQTCA(^^ 
(XmAanTCACCATIGAGACGACGACmiXKXACAAGACGCXKnXS^ 

40 cagcggcxjaggcaggactggtaggggcaggatgggcatttacaggtitgt^ 
cx:aggagaa<xigc(xtcxkkk:atgtixx5ati^^ 

ACXKXKKKn*GTGCnTXK?rAC»AGCTCACGCmKX»AGAC^^ 

GGCHTACXTAAACACACCAGGGTTGCmntnXSailAGGAC!^ 

GAGAGanxnTTACAGGCXnt^ACa^ACATAGACXKX^CATTIXn^ 
45 AGCAGGCAGGAGACAACrrOOCXrrACCIGGTAGCATAC^ 

CXiAGGGCTCAGGCTOCACXnaiATCXJTGGGACCAAATGI^^ 

GCTAAAGCCTA(XKnX3CACXK3GCCAAaKXXXnrGCT 

CAAAA(^AGGTTACrACCACACACCa:ATAAa:V^ 

CGGCTCACCTGGAGGTCXnXIAOGAGCyV^COXKKjl^^ 
50 CAGCTCrrGGCCX3a3TATIX3COGACAACAGGCAGCG^ 

CATCrTGT(X:GGAAAG<XXK]KX:ATCATrc<XGACAGGG^ 

TTCGATGAGATGGAAGAGTGCXKXnCACACCIXXICTrACATCXjy^ 

AGCTXXXXX3AACAATrcAAA(lVGAAGGCL\ATCXK^^ 
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AGC^GOKjAGCKnXKnXKnXXCXnGGTGGAAT^^ 
anrCIXKKKXSAAGCATATGTGGAATTTCATCVVG^^ 
CJnXJTCCAClUKKXTGGCAACXXXXX^GATAG^ 
TCTATCA(X:AGCCXX3CTCACX:ACCX:AACATAaX^ 
5 GATGGGTGGCXXKXX:AACTIXKnx:CnOXAGCXK^ 

GGCATCGCTCGAGCGGCroTIX3GCAGCATAGGCCTI^^ 
TTmK]K:AGGTrATGGAGCAGGGGTGGC:AGGCGCX3CTCGTGGC(^^ 
GAGCGGaiAGATGCCOXXACXX3AGGA<XTGGTrAA<XTACT^ 
TCCC<:nX3GCGC<XTAGTCXnXXKKK3 
10 IGGGCXXAGGGGAGGGGGCIGTGCAGTGGATGAACOKKnXJATAGCGT^^ 
CGCGGGGTAACX:Aa3TCTCCCXX:ACXK:ACTATGrra^ 
AaSTGTCACTCAGATOnXITCTgGTCriTACX:^ 
ACX^AGTGGATCAACGAGGACKKnXXACGCXilATGC^ 
ATGTTTOGGATTGGATATGCACCGTGTIGACnGAT^ 
15 CAAGCIXXIXKXXKXjATTGCXXKKSAGIXXSXT^^ 

AAGGGAGTCrGGCGGGGCGAaKX:ATCATGCAAA<XAan^ 
CAGATCACX3GGACATGIX3AAAAACXKmCCATGAGQATOGTC^ 
TGTAGTAACACGTGGCATGGAACATrcaXATTAAOKOT^ 
GCACXKXXrrCXXXXKKXKXAAATTATTCrAGGGC^^ 
20 GGAGTACXjTGGAGGTTACXKXSGGTGGGGGATITCCACTAC^^ 
CACraACAACQTAAAGTGO0CmxnCAGGTIXXX3^^ 
GTGGATCKSGGTGCGGTTGCACAGGrAaKnXXAGCGT^^ 
AGGAGGTCAGmtXnGGTCGGGCTCAATC^^ 
ATGK:X3AGaXX3AA<XXK}ACGTAGCAGTGCTCA(nTCX:ATGC^ 
25 CACATTACXKSCXKiAGACGGCTAAGOGrgGGCIXKKXAGG^ 
TGGCCAGCn€ATCAGCrAGCX:AGCnX3TCrGCXXX^IT^^ 
TACXX:GTCATGACIXXXXXK3ACXKnXJA<XTCATCGAGGa:A^ 
C^GGAGATGQGCXKKJAACATCACCXXKXfTGKSAGTCAGAAAATAAGGTAG^ 
TTGGACnXnTKXSAGCXXKnXX^AAQCXKJAGGAGGAT^ 
30 CXKKXKSAGATOCroCGGAGGTCCAGGAAATimrim 

CACXKXXXKSATTACAACCOXXACTGTrAGAGTan^^ 
OXnXXAGTGGTACAOGGGTGTCCATTGCXXKX^^ 
CXJrCCACXjGAGGAAGAGGACGGTrrGTCXn'GTCAGAATCT 
TCKKXKiAGCKXKX^ACAAAGACXnTCXKK^ 
.35 GCGGCACGGCAACGGCCIUItXTGACXIAGaXJrCXICACXj 
CXXSAaSTTOAGTCXn'ACnPCCrrCCATGCCXXX^ 
CGATCTX:AGa}ACGGGTCrmKnCTA<XGTAAGC^ 
CGTCTGCnXK:TCGATGTCXjrACACATCKjACAGGCXK^ 
GCXKSAGQAAACCAAGCTOCXXATCAATQCACIGAGC^ 
40 ACAACTTOGTCTATGCnACAACATCri^^ 

TCACCTITGACAGACTCCAGGTCCTGGACX3ACCACTA(XGC^ 
GATGAAGG<X3AAGG0GT0CACA(nTAAGGCrAAACr^ 
Cnt3rrAAGCTOAOG(XXXX:ACATTCX3^^ 
GAa}T<XXK}AA<XTATCCAGCAAGG<XGTrAACX:ACATC^^ 
45 ACITGCKKjAAGACACTGAGACACX:AATTQACACCACCATCATGGCAAAA^ 
AGGTmCrroantXIAACXIAGAGAAGGGGGGC^^ 
ATTCCTAGATITGGGGGTItXnXnXiTGCGAGAAAATGGaXTITAro 
TCX^ACXXTCajTCAGGOCXnGATGGGCIC^ 
ACAGCGGGTCX3AGTTCCIX3GTGAATGCCIGGi^ 
50 CITOKT^TATGACAaXXKnXjTlTK^ 

GTrGAGGAGrrcAATCTACX:AATGTrrGTOACTIXKK^ 

TAAGGTCX3CTCACAGAGCX3GCTTTACAT(XKK3GGCCCCC^^ 

GCAGAACTG0GGCTATCXKX3GGTG0CGCGCGAGaKnGTAC^ 
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CXKJTAATACXCrcACATGTTACrrGAAGGCaKn'GCXKKXTO 
CTCCAGGACKK:ACGATGCKXnATGCGGAQAOGA(^^ 
GaXX3GGGAOOCAAGAQGACGAGGCGAGCCTACXK3GCCnTCACGGAGQCTATGA 
CrAGATACTCTG<X(X0CCKKKKjACCCX3CXX:AAA<XAG^^ 
5 GATAACATCATGCTOCTCCAATGTGTCAGTCGaSCAC^^ 

CAGCTAGACACACTCXJAGTCAATTCCTGGCTAGGCAACATCATCATGTATGCGC^ 
CACCnrGTGGGCAAGGATGATOCTGATGACrcATITCTKn^ 
AGGAACAACrreAAAAAGCXXTAGATroTCAGATCTACGGGGOCT^ 
10 TGAGCCACnTGAOTACCTCAGATCATTCAACGACTCCATGGCOT 

CACIXXATAGTTACIUKXJAGGTGAGATCAATAGGGTGGCrrCAT^^ 
ACTTXK3GGTAaXKXXrrroC»AQTCrroGAGA 

AGGCTACTGTC<XAGGGGGGGAGGGCnX3CX:ACTIX3TGGCAAGTA<Xr^^ 
GGGCAGTAAGGACX^AAGCTCAAACTCACTCCAATCCXXKK^ 
15 TTTAlXXAGCTGGTKXnTCH:r[XKmACAGCGGGGGAGACATATAT^ 
TCTCGTC<XOGACCCCGCTGGTrcATGTGGTG(XTAC^^ 
AGGCATCTATCTACrCXXXI^CX^ATGAAaKKK^^ 
AGGCCATCXnOTITTm^^ 

TriTmunxTiTiTiTrrocrrLTiTiTnccTmum 

20 TAGCOnAGTCA0GGCTAGCTGTOAAA.GGTaxn^ 
ItKnXJATACIXKKXnxnCTGCAGATCAAGT 



SEQIDNO:10 : Nucleotide sequoiceofDNA clone of HCV adaptive lepliooaV.^i^iae 
25 nucleotide change is in lower case and UghligJitBd in bold 

G<XAGCax:CGATKKKKKXX5ACACTCCACCATAGATCACrc<X 

TACnBTCrrcAOKJAGAAAGCGTCrAGCXATGGCCOT 

CCTCCAGGAa:CCa:CTCCXXK3GAGAGCCATAGTGGTCrGa3^ 
30 CAOXjGAATTGCCAGGACGACCXKKntXTITCTKKiATC^ 

GGAGATTKK3GantKX<XCGCXjAGACnX3CTAGCCGAGTAG^ 

AAGGCXTIXnXKn-ACnXjCCroATAGGGTQCITGCGAGTGCXXXXMKJAGG^^ 

AGACXXnXK:A(XATGAGCACGAATCCTAAACCrrCAAAGAAAAACX:AAAGG<^^ 

GOCATGATroAACAAQATGGATTQCACXK^AGGTIXTItXXKS^ 
35 GGCTATTCGGCTATGACIGGGCACAACAGACAATtXKKnXKn^ 

GTXXXCGCKjTCAGCXSCAGGGGCGCXXXKjTIUI^^ 

GGTOOCXJTGAATGAACroCAGGACXSAGGCAGCXKXKKTATOGT^^ 

AOSGGCGTIXXTroCGCAGCrGTGCTCGACGTK^ 

GGCTGCTATIXKKKXJAAGTGCXXKKKK^AGGATCnX^ 
40 TGCCGAGAAAGTATCCATCATGGCTGATGCAATGKXKSCXKK^^ 

Ca3GCTA<XnX3CO:ATKX5A(XACCAAGCXjAAACATC^ 

ACI«K5ATGGAAG(XGGTCTrGTCX3ATCAGGATGATCT^^ 

GGGClXXKXK:CAGCCGAACroTTaKX:AGGCIXL\^ 

GAGGATCTCQItXmiACOCATGGCGATC<XTGCTnKXX} 
45 ATGGCXXKJITrTCrKKSATTCATra 

TCAGGACATAGCXmGGCTA(XCGTOATATIXK7roAAGAGCITGGCGGCX3A^ 

GCIGACCXKnrCXITCGTGCTrrACXK^^ 

CTTCrATCXKXTTCTraAOOAGTICTICro 

CrurAGCGGGATCAATTCCGaXCltrn^ 
50 AAGCXXSCITGGAATAAGGaXKnxnXXXnTlxnCTATAT^ 

TGCCCnrrrnX3GCAATXnQAGGG<XXX3GAAA(Xn^^ 

CATTOCTAGGGGTCTITCXXXnCIXXKX:;^^ 

GTGAAGGAAQCAGTKXTIXnGGAAGCnTCrrTGAAGAC^^ 
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AaXTrnK:AGGCAGCGGAACCCCXXACX7KK^ 

AG<XA<Xm3TATAAGATACA(XTGCAAAGCK:X3GCACAA(X(XAGTGCCACGTO 
TGAGTKKjATAGTIXmiGAAAGW^GTCAAATGGCKnCC^ 
GGGGCrGAAGGATGax:AGAAGGTAaXX:ATK3TATC(W 
5 CGGTGCACATGCirrACATGTGTITAGTCGAGGTTAAAAAACXrTCTAGC^ 
GAACXJACGGGGACXnGGTriTCCriTGAAAAACACXfATAATACC 
TACXKKXTACrCXXJAACAGACXKXJAGGanACTTGGC^^ 
ACAGGaXXK3ACAGGAA<XAGGTCGAGGGGGAGGTCCAAGTGGTCT^ 

ACACAATCirrcmKSCXMcxnxxxncAAi^^ . 

10 GTGCCGGCTCAAAGACXXnTXKXXKJCXXIAAAGGGCXXIAATC^ 
CX:AATGTGGACX:AGGACXJrCGTCXKKnGGCAAGaKXX:<^^ 
GACACQ^TGCACCTXKXK3CAGCnx:GGA<XTITACITGGT^ 
GTCATKXXKmXXKXXSGCGGGGOJACAGCAGGGGGAGCCTAC^^ 
CCCGTCTCXTTACnTGAAGGGCKJITCGGGCGGT^^ 

15 ACGCIXSTGGGCATCrnCGGGCroCCGTGT^ 
GGACriTroTACXX3GTCX3AGTCrrATGGAAA(^ 
GACAACTCXnaXXTOOGGaxnAOXK^AGACATICXIAGGTGGC^^ 
CmnACTGGTAGCXKKIAAGAGCACTAAGGTGCJOGGCro^ 
GGTATAAGGTGCTIXnajrQAAC»XriXXXnx:GC^^ 

20 GTATATCTCTAAGGCACATGGTATOGACXCTAACATC^ 
CATCACCACXKKnGCmx:ATCACGTACroCAC^ 
GGTGGTKKriX:nXKKKKKX5(XTATGACAT^ 
CTGACTCGACCACTATCCIXKSGCATCXKKACAGTCCT 
CnXKJAGCGCGACrCXiTCGTGCrC^^ 

25 G(XACATCK1\AACATCGAGGAGGTGGCTCIX3TCX:AGC 

tatggcaaag<xatcax:atcxjagaa::atcaagggggggaggca<xtcat^ 

gccattcx:aagaagaaatgtoatcagckxkxxkx3aagctgtcc^^ 

caatgctx3tagcatattac(xkkkkxntgatgtatcx:gtca^ 

GACGT(^TraTCGTAGCAACXK3ACXK7IUrAATGACGGGCTITACCGGra 
30 ACTCAGTGATa3ACIGCAATACATGTGTCA(XX:AGACAGTC^ 
CXX!GAC5CTrcA<XATTCAGACQACGACX3GTCO^ 

CAGCXKKXJAGGCAGGACroGTAGGGGCAGGATGGGCATTTACAGQTrr^ 

(XAGGAGAACXKK:(XTCXKKX:ATGTimATIXXnXXKn^ 

ACGCGGGCIX3TGCriX3GTAOGAGCrcAaKXX^ 

35 GGCITACCTAAACACACCAGGGTKKXXXnUKMXAGGAiXAT^^ 
GAGAG(Xn'CnTrACAGGCXnx:ACCX:ACATAGAaKXX:ATIT^^ 
AGCAGGCAGGAGACAACTIXXCCTA(XrrGGTAGCATA(XAGGCrAa^^ 
CX:AGGGCTCAGGCTCX:A<XTCX:ATCXnXK3GAa:AAATGTGGAAC^^ 
GCrAAAGCCTACXKJIGCACX3GGCX:AACX3C«XrroCIOT 

40 CAAA^^CGAGGT^ACTA(XACACA(XXX!ATAAa:AAATACATCATGGC^ 
CGGCnX3ACCnXK3AGGTCmx:ACGAGCACX^IXK3GTG(^^ 
CAGCKnXKKXXKX3TATIGCXnXjACAACAGGCAGCXnX^^ 
CATCrTGTXXX3GAAAG0CXKKX:ATtATIXXCQACAGGGA^ 
TimATGAGATGGAAGAGTGCG«nXZACACCraXTrACATCGAA<^ 

45 AGCrOSCCGAACAATTCAAACAQAAGGCAATCX^ 

AGCAAGCXKSAGGCTCCTGCrOCXXnGGTGGAATCCAAGTGG^ 

CCTIUTGGGCXJAAGCATATGTGGAATITCATCAGCGGGATACAATAm 

CTrGTCX:ACrClXX:croGCAACXXCX3CGATAG<^^ 

ix:rrATCACx:AGcmKncA(XACxxAACATAC«:ri^^ 

50 GATGGGTGGCCGaXAACITGCIXXnXXCAG<^^ 
GGCATCXjCnXKSAGCGGCroriXKKlAGCAT^ 

TTITGGCAGGTrATGGAGCAGGGGTGGCAGGCXKXHn-CGTGGCCITrAA<^^ 
GAGCGGCXSAGATXKXXnCXIACOjAGGACXnGGTr^ 
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TCmnt3GCXKXXn'AGTCGTOGGGGTCXm3TGa^ 
TGK3GCXX:AGGGGAGGGGGCIX3TGCAGTGGATGAA(XXKKn" 
CGCGGGGTAACCACmUIXXm:AaK;ACTATGl<KXnX3^A^ 
ACGTOTCACTCAGATCXTrcTCTAGTCrrACCAT^^ 
5 ACCAGTGGATCAACGAGGACIXXnXXACGCCATGCimSGCI^ 
ATGTITGGGATrGGATATGCAaKSimTGACKmTrcAAGAOT 
CAAQCnXXritKXXKXJATKKXXXiGAQTCXXXTIUI^^ 
AAGGGAC?KnXKKX3GGGa3ACXKK:ATCATGCAAACCACCnXKXX^ 
CAGATCACCGGACATGTGAAAAAOGGTrcCATGAGKSATCGTGGGGiXTAGGACC 
10 TGTAGTAACAOnXKKlATGGAACAlTCCOCATTAACGaSTAC^ 
GCACGCCXnCCCXXKKXKXLVAATrATTCTAGGGCGCI^ 

GGAQTACGTGGAGGTTACXXXKjGTGGGGGATTrcCACrAC^TGACGGGCATGAC 
CACroACAAOGTAAAGTGCCX»TGTCAGGTKXXKKX:CCC^ 
GTGGATGGGGTGCXKmGCACAGGTACXKnPCX::AGCXn^ 
15 AGGAGGTCACATKXnXKnCXKKKnX^AATCAATACCI^^ 
ATGCGAG(XXX}AACaK3ACGTAGCAGTGCrcACrrcC^^ 
CACATTAOQGCXK5AQACX3GCrAAGCmAGGCIXKKXAGGGGATCIX^^ 
TGtCX:AGCrcATCAGCTAGCX:AGCTXntlt^^ 

AcoxrrcATGAcnxxxxK5AaKnx3A(xri^ 

20 AGGAGATGGGCXKKSAACATCACXCGCGTGGAGTCAGAAAATAAGGTAGTAAm 
TGGACIXnriaSAGaXKnXXAAGCXiCSAGGAGQATOAQAGGaAA^ 
CGGCGGAGATOCIXKXK3AGGT<XAGGAAATiaxrrcX3AGC» 
CACGCXXXKSATTACAACOnXXACroTrAGAGTOCn^ 
OXJKXy^GTGGTACACXKKJroTCCATKKlCXKXJI^^ 

25 CCTCX3A<XGAGGAAGAGGACX3GTrGTCCIGTCAGA^ 

TGGCGGAGCim:CACAAAGA(XTnXKK:AGCTCX:GAATCGTCG 
GOGGCACXKICAACGGCOXnCXnXSACCAGCCXTIXXG 
COGACGTrGAQTCGTACianC!CATGaXXmn^ 
CGATCrrcAGOGACXK3QTCTItK3TCTA<XGTAAGCX3AGGAGGCT 

30 anXTTGCnXKnCGATGTCCTACACATGGACAGGCXKXXnC 

GCGGAGGAAACXIAAGCroCXXATCAATGCACroAGCAACnXnT^^ 
ACAACTTGGTCnATGCTACAACATCritXK:^ 
TCAOCTITGACAGACKKLVGGTCXnGGAOTACCAC^^ 
GATCAAGGa3AAGGCGTCX:ACAGTrAAGGCTAAACTIUrATCX^ 

35 CTGTAAGCTGACXXXXXXIACATnXKXXIAGATCT^ 

GACGTCCGGAACCTATCCAGCAAGGCXXnTAACCACAT^^ 
ACTKKJIXKSAAGACACTGAGACACCAAITGACACCACCATCAT^^ 
AGGlTITCriX3CGTCCAA<XAGAGAAGGGGGG<XGCAAG<XAGCI^^ 
AT«XX:AGATTIGGGGGTrCXm3TGTGCGAGAAAATGGC^^ 

40 T0CAcxx:rrccxnx:AGGaxrK3AT^^ 

ACAGOSGGTCGAGTTOCIGGTGAATGCCnGGAAAGCGAAGAAATGC^ 
CriTCGCATATGACAC!CX30CTQTITIX}AC^^ 
GTTGAGGAGTCAATCTACCAATGTroTGACnTGGC^^ 
TAAGGTCXKJrCACAGAGCXKK:TITACATCX3GGGGa:C^^ 

45 GCAGAACIXKXKKnATCXKXXKnXSCOGCGCXSAGCGGTGTAC^ 
CGQTAATACCCTCACATGTrACTraAAGGCXX^ 
CKX:AGGACrGCACX3ATGClXXnATGCXK}AGAC»AC^^ 
GaK;GGGGAax:AAGAGGA(XJAGGCGAGCCTACXKKK:C[T^ 
CTAGATACKnX3aXCm:TGGGGACXXXKX:CAAACCAGAATA<^ 

50 GATAACATCATGCTCCItXIAATGTGTCAGT^^ 
GTGTACTATCTCACCXXJroACOCXl\OCA(XXXX^ 
CAGCTAGACACACItXAGTCAATTCOXKKn'AGGCAACATCATC 
CACCITGTGGGCAAGGATGATXXTGATGACTC<VTTT^^ 
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AGGAACAACTTGAAAAAGCCCTAGATrGTCAGATCTACGGGGCCTCTrACTC^ 
TGAG<XACTTGACCrAanx;AGATCATTCAACGACIXXATG<^^ 
CACTCCATAGTTACICrCCAGGTGAGATCAATAGGGTGGCrrcATGCC^ 
ACTltXKKn'ACCGCCCITGCGAGTCIXKjAGACATCGGGGX^AGAAG 
5 AGGCTACKSTCCCAGGGGGGGAGGGCTGCJCACTraTGGC 
GGGCAGTAAGGACrAAGCTCAAACTtACrCCAATCOD^^ 
TTTATCTAGCrGGTTCGTTGCIXKmACAGCGG^ 
TCTCXnXKXXXjACXmSCTGGTrcATGTGGTGCCrACTCCTAC^^ 
AGGCATXJTATCTACTOCCCAACCGATGAAOGGGGACXnAAACAC^^ 

10 AGGcxATCCTGiniriiccciiriiiriiriuiiiiriiiiriiiiTiTiTiiiiiiiT 
TirrmcrcciTiTTrrTT^^ 

TAGOXTAGTCACGGCTAGCTOTGAAAGGTCXXmJAG 
TGCKSATACTGGCOCTCrGCAGAtCAAGT 

15 

SEQIDNO:!!: NS5A gene of DNA clone of HCV adqitive replicon IV, where nuckotide 
change is in lower case and highlighted in bold 

1XXGGCTCGTXKKTAAGAGATGTITGGGATIX3GATATGCAC^^ 

20 TCAAGA(XrrGG<nXXAGTCCAAG<nanXjCXX3CX3ATrG<^^ 

(nCATGTCAACGTGGGTACAAGGGAGTCrGGCCGGGCGACGGCAT^^ 
CACOXKXX:ATGTGQAGCACAGATCACXX3GACATGTGAAAAACGGTrrc 
GATC5GTGGGG0CTAGGACXntnAGTAACAamKX:ATGGAA 
Ga3TACACX:A(XIGGCXXCTGCACGCXXnCCXX:X3GCGCCAAATrAT^ 

25 TGTGGCXKKmKKnGCTOAGGAGTACXnXHjAGGTrACGaK^^ 
ACTAOGTCACGGGCATGACXACTGACAACQTAAAGTGCCmrcTC^ 
aX^CGAATTCTTCACAGAAGTGGATGGGGTGCXKjTKKJACAGGTAC^^ 
TGCAAACmTrcCTAOKKJAGGAGGTCACATIXXrroGTaJGGC^ 
TtKmXKKnCAC^GCKXX^ATGCXSAGCCXXSA^ 

30 CATGCTCAOIXSACmirraXACATrAaK^ 

AGGGGATUIXXmXTgCTTGGiXAGCrCAT^ 
CITGAAGGCAACATGCACrACCX:GTCATGACnXX:CCGGA<^^ 
GCX:AAanXXTGTGGCGGCAGGAGATGGGCGGGAACATCAC^^ 
GAAAATAAGGTAGTAATmXKSACKnTKXSAGOXKnt^^ 

35 GAGAGGGAAGTATCCGTTCCXKSCGGAGATOCTGCXSGAGGTCC^ 
CGAGCX}ATGC!CX:ATATCGGCACGCCCGGAlTACAACCmx:^ 
GGAAGGAC0CXK3ACTACXmXCKX:AGTGGTACACGGG^ 
CAAGGCXXCTCCGATACX:AaJKXACGGAGGAAGAGGAC^ 
ATCTAC)CXnxnxnTCTGCXnTGGCXK3AGC^^ 

40 GAAT0GTaKKX!GTCXSACAGCXKK:A0GGCAACX3GC^^ 
ACXSAOKKXSACXKXKSGATCOGAanTCAGTOGTA 

GGGGGAGCCGGGGGATCXX^GATXnCAGCGACGGGlXJITGGTCTACCXff^ 
GGAGGCTAGTGAGGACXnanCrGCrGC 

45 

SEQIDNO: 12 ; NS5A goie of HCV adaptive iiq>licon HI, where nucleotide diange is in 
lower case and hig^^ited in bold 

TCCXKKrrcGTGGCTAAGAGATGTrrGGGATKKJATAT^^ 
50 TCAAGA<XnXK3CrCX:AGTCCAAGCTCCn^ 

CICATCTCAACXmKKn'ACAAGGGAGTCrrGGO^^ 

CACOX3CO:ATGTGGAGCACAGATCACCGGACATGTGAAAAACGG1^^ 

GATCX?KKKKKXJrAGGACXJrGTAGTAACACGTGGCATGGAACAT^^ 
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Ga3TACAa:Aa3GGCXXCK3CACXKXXnXXXXX3GC^^ 
TGTGGCGGGTGGCroCTGAGGAGTACGTGGAGGTTACGCGGGTGGGGG^ 
ACTA(X3TGACGGGCATGA(XACrGACAAa5TAAA(JIX3<XX:GTG^ 
(XmSAATTCTTCACAGAAGlKKSATGGGGTGaKmX^ 

5 TGCAAACmnnJTACGGGAGGAGGTCACAlTCXn'GGTCXKKXnCAA 

TGGTTGGGTCACAGCTC(XATGa3AG<XX:GAACCGGACGTAGCAGTGCTCAC^ 
CATGCrcACCGACa:CTCCX:ACATrACGGCGGAGACGGC:TAAGCGTAGGCrGGCC 
AGGGGATCTCCCCXCcCKJITGGCCAGCrcATC^ 
CITGAAGGCAACATGCACTACCCXntlATGACIXXXXGGAC^^ 

10 GiXAACCnXXnXxTGGCXIKXIAGGAGATGGGaKKJAACATC 
GAAAATAAGGTAGTAATmGGACTCrmXXiAGOC^^ 
GAGAGGGAAGTATCXX?rKXXK3CX3GAGAT(X7roaKjAGG^ 
CGAGOJATGCXXATATGGGCACGCmKJATTACAACCCrCCAC^^ 
GGAAGGAOXXjGACTACGIXXCTCCAGTGGTACACGGGTGTa:^^ 

15 CAAGGCXXXnXXX5ATACCAOCKX:AOGGAGGAAGAGGAa3GT^^ 
ATCTACXGTGTCITCIGCXnTXKKXXiAGCn^ 
GAATCGTCGGCOGTCGACAGCXKKlACGGCAAaSGCXJ^^ 
ACOACGGCGACX3CGGaATCXXM.CX3TK5AQTa3TA(n^ 
GGGGGAGCaKKKK3ATO(XX5ATCrrcAGCGAOGGGTCI^^ 

20 GGAGGCTAGTGAGGACmtXniCTGCrGC 



SEQIDNO:13 : Nucleotide sequence of DNA clone of HCV adi^tive leplicon VH, wlieie 
nucleotide change is in lower case and luj^j^iled in bold 

25 

GCCAGCCCXX»ATTGGGGGCGACACia:ACX:ATAGATCAC:^^ 

TACrGTCTTCAaX:AGAAAG<X?IUrA:GCX:ATG« 

COaiAGGAtXmXXnXXCGGGAGAGCCATAGTCGTC^^ 

cacoksaattcccaggacgaccxksgtctnt^^ 
30 ggagatttcggogtg<xcocgcgagaclxkrrag(xgagta 
aaggccntgtggtacigongatagggtgcitgcxsagtgcc^^ 
agacx:xjixk:acx:atgagcacgaatcctaaacox:aaagaaaa^ 

GCX:ATGATroAACAAGATGGATTGCAaK;AGGTIxm:aKK^^ 

GGCTATTCXjGCTATGACIGGGCACAACAGACAATCGGCK^^ 
35 GTIXXGGCTGTCAGCXKlAGGGGaKXaKmC^^ 

GGTGOXTGAATGAACTCCAGQAOGAGGCAGCCaSGCrATCmGGC^^ 

ACXSGGCXmCCTIGCXKAGCraTXKnXXSAO^^ 

GGCTOCTATKKKKX3AAGTGCX»GGGCAGGATClXXnxnx::AT^ 

TGCXX3AGAAAGTATOCATCATGGCK5ATGCAATGOGGCGGCTG^ 
40 CCGGCrACCroC0CATIXX5ACX:ACCAAGCGAAACATCXK^ 

ACTCCGATGGAAGCCGGTCTnHXDGATCAGGATGATCrrGGAOGAAG 

GGGCIXXKXKX:AGOCGAACIGTIXXKX:AGGCrcAAG 

GAGGATCTCGTCGTGACXXATGGCGATCCCTGCT^^ 

ATGGCCXXTirrCOXKSATTCATCGACIXnGGCCGGCT 
45 TCAGGAGATAGCGTIXK]K:TACXXXn-GATATIXKJ^ 

GCTGACCGCTrCXnx:mX3KJrTTACX3GTATCGCC^^ 

CTTCTATCXKXTnnTGACGAGTIOTCrraAC^ 

CICTAGCGQGATCAATTCXXKXXXrrcritXX^^ 

AAGCCGCXTGGAATAAGGOXJGTGlXKXriTIX?^ 
50 TGCmKnrmKK^TGTGAGGGOXGGAAACXr^^ 

CATIXXTAGGGGTCrrrcm^KnaKXIAAAGGAATGC^ 

GTGAAGGAAGCAGTKXnCIXjGAAGCITCriTGAAGACy^ 

A(XCTT1XK:AGGCAGCGGAA<X0CXX:A(XJIX3GCGACAGGTGCC^^ 
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AGCCACGTGTATAAGATACAanXKJAAAGGOGGCACAACCXX^AGTGCCAO^^ 
TGAGTTGGATAGTTOTGGAAAGAGTCAAATGGCTCTCCTrcAAGC^ 
GGGGCroAAGGATGCCCAGAAGGTACmiATIxnATGGGATCTQA 
. CXK3TGCACATGCnTrACATGTGTTTAGTC»AGGTr^^ 
5 GAACXACGGGGACGTGGTrrTCCirraAAAAACAaSATAATACCATGGa^^ 
TAaKK:CTACT(XX:AACAGACGCGAGGCCTACTTCGCIXK:Al^ 
ACAGGCX:GGGACAGGAAa:AGGTCX}AGGGGGAGGTa:AAGTGGTCTCCA 
ACACAAlXnTrcCIX3GCGA(XTGCGTCAATGGCGIX3^^ 

gtg<xggctcaaaga<xctkkxxkkxx:aaaggg(xcaatca^^ 
10 ccaatgtggacxiagoacoajtcggciggcaagogocxxx^o^^ 
gacacx:atgcacck3cggcagcrcgga(xtitacti^^ 
gtcattcokstgcxkxixkxxkkkkxsacagcagggggagcctac^ 

CXXGTCTOJrACITGAAGGGCTCriTCG^ 

ACGCTOTGGGCATCriTKXKKKnGCCGTGTGCACCXDGAGGGGT^ 
15 GGA<nTIXnA(Xmia3AGTCTATGGAAACCACTAl^^ 
GACAA(nXXrnXCO"<XGGQ:GTAaXK:AGACATlX^ 
CCCCTACTCGTAGOGGCAAGAGCACTAAGGIGaXKKnG^ 
GGTATAAGGTGCrTCTOCTOAA(XX:GTCCGT^^ 

GTATATGTCTAAGGCACATGGTATCGACCXTAACATCAGAACCGGGGTAAGGAC 

20 CATCACX:AaK3GTGCmx:ATCA0GrACKX:AC^ 

GGTGGTlXKrrCTGGGGGCGOTATGACATCATAAT^^ 
CnXSACTCGACCACTATCCrGGGCATOKKiA 
CTGGAGCGCGACntXnx:GlXKr[XXKX:ACOGCTACXKX^ 
GOCACAT0CAAACAT0GAGaAGGTGGCTCTGTCX:AGCAC^^ 

25 TATGGCAAAGCX^ATCXXXIATCGAGACCATCAAGGGGGGGAGGCACOCATTTTCT 
Ga:ATrCX::AAGAAGAAATGTGATGAGCTCXK:a^ 
CAATGClXJTAGCATATTACCXKjGGOCTroATG?^^^ 
GAOTTCATKntmAGCAACGGAaXnUrAATGAOGGGCTI^ 
ACTCAGTGATaSACKKJAATACATGTGTCACa^AGACAGT^ 

30 CXXGACCITCA<XATTGAGAa3ACGAC0GTG<XACAAGACGCGGTGrcAa^^ 
CAGCGGCGAGGCAGGACKKnAGGGGCAGGATGGGCATTTACAGGT^ 
a:AGGAGAAOGGCOCrcGGGCATGTrcGATlXXnCG^ 
A<XCGGGCTGTCCTIGGTACGAGCnx:AaKXX3G0C»AGAC^ 
GGCTrACX:TAAACACA(XAGGGTIGCXXX3TCriG0C^^ 

35 GAGAGCX3TCTrrACAGGCCTCA(XCACATAGAC^^ 

AGCAGGCAGGAGACAACTItXXXn'ACOXKn'AGCATACXIAGGCTACGGTGTGC^ 
CXJAGGGCTCAGGCKX^ACCrcCATOGTGGGACCAAAT^^ 
GCTAAAG<XTA0GCKK:ACXKKKX:AACGCOXnrG<nX3TATAGGC^^ 
CAAAAa3AGGTrACnAa:ACACACXXX:ATAA<XAAATACATCATGGCAT^ 

40 CGGCIGACCTOGAQGTCXnXIIACXSAGCAanG^ 

CAGCKnXSGCCGCGTATTGCCTOACAACAGGCAGCGTGGTCATIXj^^ 
CATCTKn€CXK3AAAGaXKKXATCATr^^ 

TTCGATGAGATGGAAGAGTGKXKXnCACACOCCCTrACATtXrAAC^^^ 

AGCIXXK:0GAACAATrcAAACAGAAGGCAATCXKK3?ITGCIXK:A^ 
45 AGCAAGOSGAGGCIGCroaXXXXnGGTXKjAATCCAAGTGGCGG 

CXJITCTGQGC»AAGCATATGTCGAATrrcATCAGC^^ 

CTIXnXX:ACTXnXKXnXKM:AACXXXXKX3ATAGCAT^^ 

TCTATCACX^AGCXXXKJrcAaLVCXXJAACATAaxnXDC^^ 

GATGGGTGKKXXKXXJAACITGCTanXXX: 
50 GGCATOKnXKSAGCXKSClXnTGGCAGCATAGGC^^ 

TTrroGCAGGTTATGGAGCAGGGGTGGCAQGCXKXKntXn'Gi^^ 

GAGCGGCGAGATGCXXJnXACCGAGGAiXrrGGTrAACX:^^ 

TCmnX3GCXKXCTAGTCGTCGGGGTa?IGTGCGCAG<^ 
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TGGGOXAGGGGAGGGGGCraTGCAGTGQATCAACXDGGCrQATAGCm^ 
CGCGGGGTAACCACGTCTCOC(XACGCACTATGTGCCTOAGA^ 
ACX3TGTCACrCAGATCOXTrCTAGTCTrAa:ATCACrCAGC^^ 
A<XAGTGGATCAA<:X3AGGACIX3CTCCAa3<XATGCrC(^^ 

5 ATGTTTGGGATTGGATATGCAOKnXSlTGACTOATTrCAAGACCK^^ 
CAAGCr(Xn'GCCXKX5ATTGCCGGGAGTCmTIOTCrcATC^ 
AAGGGAGTCTGGCGGGGCGACXKKJATCATGCAAACCACCTGCCCATGTGGAGCA 
CAGATCACCGGACATGTGAAAAAOGQTTCCATGAGGATCGTGGGGCCTAGGACC 
TGTAGTAACAajKKK::ATGQAACATT^^ 

10 GCACGCCCTCXXCGGCXXX:AAATrATrCTAGGGOG<nGTGGCGGa 
GGAGTACGlXKSAGGTrACGCXXKiTGGGGGATITCCACTACXjTC^ 
CA<TOACAACXnAAAGTGCXXXnmtAGGTTCXXKKXXXX)QAAT^^ 
GTGGATGGGGTGaK}TItK:AQ^GGTAaKnXX:AGa3^^ 
AGGAGGTCACATTajrGGTCXKKMnClAATCAAT^ 

15 ATGCGAGCXXX3AA(X!GQACGTAGCAGTGCrcACTKX:AT^^ 

CACATTACGGCGGAGACGGCTAAGCOTAGGCKKSCXJAGGGOATCK^^ 

TQGCCAGCTCATCAGCTAlCXL^GCnGTCTG^ 

ACXX»TCATGACItXXXXKiAC3GCTOA0Cri^ 

AGGAGATGGGOKKSAACATCACXXXXXnXKJAGTCAGAAAATAAGGTAGTAAT^ 
20 TGGACIUITrcGAG(XGCIXXAAGaK5AGGA(^ 

CXKSCGGAGATCCIXKXKSAGGTCCAGGAAATKXXJTCGAGCGATGCC^ 

CACGaXGGATTACAACXXJIXXACTGTTAGAGTCCraGAAC^ 

(XCTa:AGTGGTA(:ACGGGTGTrcAlTGCXXKXJ^ 
* (XrrcCACGGAGOAAGAGGACXSGTimCCKnCAGAA^ 
25 TGGCGGAGCTCGCCACAAAGAanTCCGCAGCIX^ 

(XGAOnTGAGTCGTACTOCTCtlATGCCXCC^^ 
OJATCrcAGCGAaKKjTXnTGGTC^^ 
anxnXKTKKnXXJATGTCXnACACATGGACAGC^^ 
30 GOKJAGGAAACX^AAGCKKXXLVTCAATGCAC^ 
ACAACTTGGTCrATGCrACAACATCIXX^ 

TCAC<mTGACAGACIX3CAGGT(XrKKjACX3A<XACTAOCGGGACXnX^^ 

GATGAAGGCGAAGGCrnXXACAGTrAAGGCTAAACTICTAT^ 

CraTAAGCTOACXKXXXCACAlTOGGC^ 

35 GACX3TCCGGAACCTATCCAGCAAGG0CGTrAACX:ACAT(XGCIXXX?^ 
ACniXKJIX3aAA<MCACK3AaACAOCAATTGAC^ 
AGGTTITCrKKXntXTUOCAGAGAAGGGGGGCXXKIAAGCX^ 
ATKXX:AGAlTIXKKKKnTCX3TGTGTGCGAGAAAATGC^^ 
TCX:AOCCIXXXJrcAGGCX:GTGATGGGCrc^^ 

40 ACAGCGGGTC»AGTICCrGGTCAATtXXnXXJAAAGCGAAGAAAT^^ 
CITOK^ATATGACVV^CXXXKnxnTITGACnC 
GTTOAGGAGTCAATCTAaXAATGTIXrroAC^^ 
TAAGGTOKnCACAGAGCGGCTrrACATCXSGGGGCXXXXJroACT 
GCAGAACTGOKKn'ATCGCXXKntKXXK^GCXSAGa^^ 

45 CGGTAATA(XCrcACATGTrACm}AAGGCCGCIXK:XK3^ 

CltXAGGACKKIACCATGCTCGTATGaKSAGAaSACCTK^^ 

gokdggggaco^aagaggacsgaggcgagcoacxkkkxnicac^^ 
ctagatacixngccxxxxxnxsgggacocgcxxjaaacxjagaatac^^ 
gataacatcatgctoctcx:aatgtgtcagtcxkxk:acgatgcatc^^ 
50 GTCTACTATcnx:Acccxnx5Acxxx:Aa:iAcx:^^ 

CAGCTAGACACACltXiiAGTCyU^TI^^ 

CACXnTGTGGGCAAGGATGATCXnXSATGACTCATlTCr^^ 

AGGAACAACITGAAAAAGOCOAGATTGTCAGATCrrACXKKSGCCT^ 



I 



wo 01^364 PCT/USOl/16822 



87 

TGAGCCACTraA<XTACCTCAGATCATTCAAOGA(ntX:AT^^ 
CACKX^ATAGTTACIUrcCAGGTGAGATCAATAGGGlXKKri^ 
ACTIXKSGGTAaXKXXnTOCGAGTCTGGAGACATCXKKK^ 
AGGCTACIX3TC<XAGGQGGQGAGGGCrGCCACITGTGGCAAGTAC^^ 
5 GGGCAGTAAGGACCAAGCTCAAACTCACnxXAATCCCXKKn'GCGTCCXJAGTl^ 
TITATOCAGCnGGmOTIGCTGGTTACAGCGGGGGAGACATAT^^ 
TClXX?roCCXX3AaXX:GCTGGTrcATGTGGTG^ 
AGGCATCrrATCTACnXXCX:AACCX3ATGAACXKK3GAGCT 

AGGCx :AT CciGTriTiTix x x;iirinmTio ^^^ 

GCCCTAGTCAaKKn'AG<nxnGAAAGGTCX:GTGAGCCQC^^ 
(nXJATACTXKKXntnCTGCAGATCAAGr 



IS SEQIDN0:14 : Amino add sequowe of the NSSA protein of HCV adaptive replicon I, 
wbeie amino acid geoerated is Ugjiligjited in bold 

SGSWIJU5VWDWICna.TDPKTWLQSKIlJPRIJGVPFFSCXJ^ 
OPCGAQrreHVKNGSEMRTVGPRTCShriWGT^ 
20 AAEEYVEVTRVGDFHYVTGMnDNVKCPCQWAPEFFrEVDGVRI^ 
REEVTFLV<aJNQYLVOSQIK:EPEroVAVLTSMLTDPSHrrAETAKR^ 
SSSASQLYSFEPIX3AEEDEREVSWAEIIJUlSRKEPRAMPIWARPDY^ffPIJ^WKDP 
DYWPVVHGOPUPPAKAPPIPPPRRKRTVVLSESWSSAIJ^ELATOT 
TATASPE)QPSDDQDAGSDVESYSSMPPLEGEP(HDPDI5DGSWSTVSEEASEDVVCX: 

25 

SEQlDNOilS : Amino acid sequence of tiie polyprotein coding.t^on of HCV adq>tivB 
replicon VI, wboe amino acid changes are highlighted in bold 

30 MAPITAYSQQTRGLIX3aiTSLTGIU)RNQVEGEVQVVSTATQSFIATC^ 
HGAGSKTIJV(n>KGPnQl>mNVDQDLVGWRAPPGARSLTPCnrC^ 
VIPVRia<HmGm-SPRPVSYLKGSSG(MIX:PSGHAVGIFRAAVCTRGVAK^^ 
FVESMETmRSPVFTDNSSPPAVPQTnFQVAHIJEIAFIGStaCSl^^ 
YLNPSVAATLOTGAYlimiAHGroPNlRTGVRTITroAPr^ 

35 DmCI>ECm'll)STllLGIGTVII)QABrAGAia:A^VIATAIPP 
GEIPFyGaCAIPIEiTKGGRHLIFaiSKKKa)ELAAK^ 
GDVIVVATDAmTCaTGDFDSVIDCmCVTQIAn^ 
RGRTGRGRMGr««fVTPGERPSGMFDSSVIX::ECYDAGCAWYELTPAE^ 
NTPGLFVajDHIJEFWESVFTGLTHroAHFIiKiTKQAGDl^ 

40 PPPSWIX5MWKCIJRIja»TLHGPTPIiYRmAVQNEVTriHPri^^ 
TSTWVLVGGVLAAIJWVYCXTTGSVVWCSRIBLSGKPAIIPDRE^ 
IPYIEQGMQLAEQFKQKMGIIXJTATKQAEAAAPVVESKWRILEAFWAKHM^^ 
GIQYIAGLSTLPGNPAIASLMAFTASnm-TlQHTLIJFNI^^ 
VGAGIAGAAVGSIGLGKVLVDILAGYGAGVAGALVAITK^^GEMPSTEDLVNIiPA 

45 ILSPGALVVGVVCAAIIJRiaiVGPGEGAVQmiNmAFASRGNHVSF 
ARVlX3ILSSLTIIQlJjmHQWINEDC3SlPGS^^ 
IXPRIPGVPFFSCQRGYKGVWRGD(HMQTIX3»CGAQITGHVKNGSN^ 
WHGTFPmAYITGPCIPSPAPNYSRALWRVAAEEYVEVTRVGDEBnfVTGMnD^ 
CPCX2VPAPEFFIEVDGVRLmYAPACXPIJJlEEVlia.VGUiQYLVGSQI^^ 

50 AVLTSMLTDPSHrTAETAKRRI^GSPPSIASSSAIQI^APSOCAlXnTRH^ 
EANIXWRQEMGGNrmVESENKVVILDSFEPIXiAEEDmUSVSVPAEn^^ 
PIWARPDY1WLIJESWKDPDYVPPVVHGCTU»PAKAPPIPPPRRKRT^^ 
AELATKTFGSSESSAVDSGTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSD 
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GSWSTVSBEASEDWCCSMmWTOAIJTPCAAEBlKLPINAI^NSII^^ 
TSRSASIJlQKKVTFDRIjQVII)DHYRDVIJaEMKAKASTVK^ 
ARSKFGYGAKDVRNI^SELAVNHIRSVWHJIIJBDlElProT^^ 
Ria»ARIJVITOIX3VRVCEKMALYDVVSlIJ»QAW 
5 AKKCPMGFAYDTRCFDSTVTEbromVEESIYQCXPIJVPEARQAIRSLl^ 
NSKGQNCGYIUlCRASGVLTTSa5im.TCYIJKAAAAaUSAKLQI^ 
lOSSAGTQEDEASLRAFTEAAnRYSAITGDPPKPEYDimTSCSSNVSVAHDA^^ 
VYYLTRDPTTPIj\RAAWETAiariTWSWIX3NIIMYAIT^ 

QLEBCAIJXX5IYGACTSIEPIJ)IPQnQRmGI^AFSI^YSPGEIimVAS<XRKIX3^ 
10 RVWWIRARSVRAMI5QGGRAATCGJCYIJNWAWTKLKLTPIP 
GYSGGDIYHSLSRARPRWFMWCLLLLSVGVGiyLLPNR 



iSEQIDNO:16 : Amino acid sequence of the NSS A protein of HCV adq>tive iqjlicon VII, 
IS ^ere amino add change is bigUigbted in bold 

SGSWIJlDVWDWICnVLTDEiaWIjQSKIIJMPGVPFFSCX^^ 
CPCXSAQm^VKNGSMWVGPRTCSOTWHGTFKNAYr^^ 
AAEEYVE\rrRVGDEHYVTGMTTDNVKCTCQVPAPEFFIEVIXj 
20 REEVTFLVGmQYLVGSQIKiEPEPDVAVLTSMLlDPSHrrAErAKRMARGSPPS^ 
SSSAIQLSAPSUCATCrnmDSFDADUEANIXWRQEMGGNim^ 
IXJAEEDEREVSWAEIIJtRSRKFPRAMPIWAW»DYNPPII£SWKDP 
IJ»PAKAPPIPPPI«KRTVVI5ESTVSSAIJVELATKIPGSSESSAVDSGTATASP^^ 
DGDAGSDVESYSSMFPLEGEPC9>PDLSDGSWSTVSEEASEDWCX: 

25 

SEQIDNO:!? : Amino acid sequoice of tbe polypntein of HCV adaptive icplicon U, ^Axae 
amino acid dianges are bighUf^ited in bold 

30 MAPITAYSQQTOGIIXjCmLTGRDRNQVEGEVQVVSTATQSELATCVNGVCW^ 
HGAGSKTLA<a»KGPriQ»mNVI)QDLVGWQAPPGARSLlPCiajSSDLYLVlIU^ 
VIPVRIUl(a)SRGSLLSPRPVSYIXGSSGGPIIX3»SGHAVGIFRAAVCTR^ 
PVESMEmtRSPVFTONSSPPAVPQIPQVAHLHAPTGSGKSTKVPAAYAAQC^ 
VllJPSVAATLGFGAYMSKAHGroPNniTGVRTrrTG^ 

35 DinCDECHSlDSTTILGIGTVUXJAETAGARLVVIATATPPGSVTVPB^^ 
GEIPFYCTlAIPimTKGGRHIJFCHSEiaKCDELAAKl^ 
GDVrVVATDAIMTCTTGDEDSVIDCKrcVTQT^a^FS^ 

RGRTGRGmGIYRFVTPGERPSGMFDSSVIX:ECyDAGCAWYELTPAETSVRLRAYL 
NTPGIJVCJQDHLEFWESVFTGLTBaaJAHFIJSQ^ 
40 PPPSWDQMWECmiJOTm(aTPIJ:.YRIX5AVQNEVmmiKy^ 
TSTWVLVGGVIJ\ALAAYCX,TTGSVVIVGIUII5GKPAIIPDREVL^^ 
IPYIEQGMQLAEQEKQKMGIljQTATKQAEAAAPVVESKmTLEAFWAKHM^^ 
mQYLACSJSTLFGNPAIASlIdAFTASnSPLTIK^^ 

VGAGIAGAAVGSIGIXaCVLVDIIAGYGAGVAGALVAFKVl^GEMPSTEDLVNLLPA 
45 ILSPGALVVGVVCAAILRRHVGPGEGAVQWMNRLIAFASRG^IHVSPTHYVPESDAA 
ARVTQn.SGa.TITQIlJmHQWIhnBDCSlFCSGSWLI^ 

LLPRXJGVPFFSCQRGYKGVWRGDGMQTra^DGAQIIXfflVKNGSMiaVGPRT^^ 
WHGTFPINAYTIX3PCTPSPAPNYSRALWRVAAEEYVEVTRVGDFHYVTGMTro^^ 
CTOJVPAPEFFTEVIXJVRimYAPACXPmffiEVTFLVGmQYLVGSQIK:^ 
50 AVLISl^TDPSHITAETAKRGIAJiGSPPSlJVSSSASQLSAPSIJK^^ 

EANLLWRQEMGGNITRVESEtQCSn^IU>SFEPLQAEEDEREVSVPAEI^ 

PIWARPDYNPPIlESWKDPDYVPPVVHGCPIPPAKAPPIPPPRiaCRTVVl^ES^ 

AELATKTFGSSESSAVDSQTATASPDQPSDDGDAGSDVESYSSMPPLEGEPGDPDLSD 
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GSWSTVSEEASEDWCCSMSrnWTGAlJTPCAAEETBaPMALSN 

TSRSASLRQKKVTFDM-QVlI)DHYIlDVUaBMKAKAST\^^ 

ARSKFGYOAKDVR^IIiSKAV^IHIRSVWIa:)IIMrI^ 

RBTARIJVITDIXJVRVCmMALYDWSTLPQAVMGSSYGFQYSPGQRVEFLVNAWK 

5 AKKCPMGFAYDTROTDSTVTEMJIRVEESIYQaS)!^ 
>reKGQNCX3YI»CRASGVLlTS(XamTCYIJCAAAACa^^ 
ICESAGTQEDEASniAFmAI^YSAPPGDPPKPEYDLEIJTSCSSNVSVAH^ 
VYYLTRDPTTPIARAAWETARHTPVNSmX3^^IMYAFTLW 
QIJEKAIJDCQIYGACTSIEPIJ)IPQnQRIJHGI^AFSmSYSPGEINRV^ 

10 RVWRHRARSVRARII^QGCaiAATCGKYLFNWAW^ 
GYSGGDIYHSLSRARPRWNCWCIIXLSV6VGIYLLPNR 



SEQ ID NO:18 : Amino add sequence of flicNSSA protein of HCV adq)tiveiq>liconI]^ 
IS ^idiere amino add diange is UghUgbied in bold 

SGSWLMDWDWICTVLTDFKTWIXJSKIIPRIJPGVIWSC^ 
CPCXjAQITGHVKNGSiMRIVGPRTCSNTWHGTFPmAYT^ 
AAEEYVEVTRVGDFHYVlXMITDNVKCKXiWAPmTTEVDGVRU^^ 
20 REBVTFLVGIJIQYLVGSQIJCEPEPDVAVLTSJMLTDPS 

SSSASQI^APSLKATCTTEUEDDSPDADUEAtOXWRQEMGCTOTRVESEN^^ 
PIXJAEEDEREVSVPAEEJtRSRKPPRAMPIWARPDYOTPIXESWKDPDYVPPVVHGCP 
lPPAKAPPIPPPRWaRT\m.SESTN^SAIJ\ELATK^ 
1XJDAGSDVESYSSMPPIJSGEP(H)PDLSDGSWSTVSEEASEDVVCX: 

25 

SEQIDNO:19 : Aimno add sequence of fte NSSA protein of HCV adq>tive i:q>licon V, 
M^iere amino add.cliange is hig^ghted in bold 

30 SGSWIJU3VWDWICTVLTDEKTWIXJSKIIJ?RIPGWFFSC^ 
CyCGAQrreHVKNGSMRTVGPRTCSNrWHQTFPINAYTTC^ 
AAEEYVEVlRVCaJFHyVTCTkmDNVKCPCXJWAPEFFTEVD 
REEVTFLVGUiQYLVGSQIJrcEPEPDVAVLTSMLTDPSHITAErAEIURI^ 
SSSASQLSAFSUKATCnSHDSFDADUEAra^^ 

35 PLQAEEDEREVSWAEILRRSRFaTRAMPIWARPDYNFPIIJESWKDPDYV^ 
IJPAKAPPIPPPRRKRTVVLSESWSSAIJ^ELATKTFGSSESSAVDSGTATAOT 
DGDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDWCX: 



40 SEQIDNO:20 : Amino add sequence of &e NS5A protein of HCV adaptive leplicon IV, 
where amino add change is U^iligbted in bold 

SGSWIJmVWDmCIVLTDFKTmQSKIXPRIJ'GVPFFSajRG^ 
CPCGAQimHVKWGSMmCOTlTCSNIWGTFPINAYITCT 

45 AAEBYVBVTOVGDEHYVKMITONVKCPOQVPAPEPFrEVDGVRI^ 

REEVTFLV<H2IQYLVGSQIJPCEPEPDVAVLTSML1DPSHITAETAKRRIj\RGSPPC^ 
SSSASQI^APSIJCATCTTRHDSPDADIJffiAMXWRQEMGGNTm^ 
PIXJAEEDERBVSVPAEnJlRSRKFPRAMPIWARPDYNPPLIJBSWKDPDYVPPV^ 
IJPPAKAPPIPPPRRKRTVVLSESWSSAIJ^ELATK'IFGSSESSAVDSGTATASPDQPSD 

50 IXSDAGSDVESYSSMPPLECffiPGDPDLSDGSWSTVSEEASEDVVCC 
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SEQ ID N0:21 : Ammo acid sequence of Ifae NSSA protein of HCV adq>tive i:q)licon HI, 
latere amino acid change is hig^gjilied in bold 

SGSWLRDVWDWICnVLTDFKTWIXJSKIIJ»Rm3A^^ 
5 CJPOJAQrraHVKNGSMRIVGPRTCSMWHGTFPmAYTlXMnPSPA^ 
AAEEYVEVTOVGDFHYVTGKmT)>rk^aPCQWAPEFFrEV^ 
REEVTFLVGLNQYL VGSQ IKJBPBPDVAVLTSMLTDPSHrrAETAKRK^ 
SSSASQLSAPSLKATCITRETOSPDADLIEAMXWRQEMGGM^ 
PLQAEEDEREVSWAEnjUlSRKEPRAMPIWAm)YNPPII£SWKDPDYWPVVHGCaP 
10 IJ»PAKAPPIPPPRRE31TVVIJSESWSSAIJ^EIATKITOSSESSA^ 
XXjDAGSDVESYSSMPPLEGEPGDPDLSDGSWSTVSEEASEDVVOC 

SEOIDNO:22: Nucleotide seqpience of DNA ckme of HCV additive lepliocm HCViep/NS2- 
5B (see Figure 9) 

15 

Ga:AGCCCm3ATTGGGGGa3ACACrcx:A(XJATAGATX^ 
TACimUTTCACGCAGAAAGCXnCrAGCXZATQGOGTrAGT^ 
COXXIAGGAOOCmxnmxmAGAGOCATAQTGGT^^ 
CAaXKjAATTG<XSAGGAajA(X»GGTCCTrrcn^ 
20 GGAGATITXKKKXnXKXXXXXKXJAGACnXKnAGaXSAGTAGTGTK^^ 
AAGGCOTGTGGTACl<KXriX5ATAGGGTGCrnKX5A<^^ 
AGAaX3TGCA<XAGACX:ACAACX3GTTTCCXnCT 

Tcantx(xcmxcrAACx3TrACix3Goc^^ 

GCXmiXncrATATGTrATITTCX:AC3CATAT^^ 

25 (XXSGAAACCTGGCCOmUITCTTOACXjAGCAT^^ 

CCAAAGGAATGCAAGGTCIXmXjAATGTCmGAAGGAAG^ 
CnTCTTOAAGACAAACAACGTCTOTAGOGAaxnTI^^ 
ACCroGCGACAGQTGCCItnXKXKKX:AAAAG<XAOGT 
AAAGGa3GCACAAOOCCAGTGCX:ACGTIXnX3AGTIXK5ATAGTTG^ 

30 CAAATGGCrCTCOtlAAGCGTATrcAAC^ 

A(XXX:ATK3TATGGGATCTOATCTCGGGCOtXK^^ 
AGTCXjAGGTTAAAAAAOGTCTAGGCmXXXjAACXi^ACXKS^ 
TGAAAAACACGATAATA(X:ATGGACCGGGAGATGGCAGCATCX3TGCGGAGGC^ 
GGTITrCGTAGGTCKMTACrciTQACXn^^ 

35 CTAGGCICATATGGTQGTTACAATATITTATCAO^ 

AGTGTGGATCmXCCXnt^AACXmXXKSGGGGGCC^^ 
ACGTGCGCGATCCAOCCAGAGCrAATCTrTACX::ATCAC^ 
TACTCGGTOCACTCATGGTGCnXAGGCTOGTATAACXAAAG^ 
GCGCXKIACACGGGCTCATTCXJKKATGC^ 

40 CATTATGTa:AAATGGCIUrcATGAAGTKKKXXK:AC^ 

ATCACCATCTCA<XOCA(nXKX3GGAClXK3GCXXACGCGGGC^ 
GGTGGCAGTrGAGCOOGTCGTCTIXnuroATATGGAGA(X:AA(^^ 
GGGGCAGACACXXjCGGOGTGTGGGGACATCATCTTGGGOC^^ 
GCAGGGGGAGGGAGATACATCriXKXSACX:GGCA.GACAGCXnTGAAGGGC^ 

45 TGGa3ACrCCrcGaKX:TATTACGGCCTA(^^ 

GCrGCATCATCACrAGCXm::ACAGGCX3GGGACAGGAAC^ 
TCCAAGTGGTCIXXACXXSCAACACAATCri^^ 
GTGTKKSACTGTCTATCATGGTGCXXKKnCAA^ 
0CAATCACXXJAAATGTACA<X:AATOTGGA0CAGOACXnCX^^ 

50 C(XXXX:GGGGCXKXnTC<nTCACACX:ATGCACO<KX^ 
TGGTCACX3AGGCATGCCGATGTCATItX:GGTGaKXXKK: 
GGAGCCTACrciXXXXX;AGG<XCGTCTCCTACI^ 
ACTGCIOGCCXXntXKKjGCACCCn^^ 
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<X3AGG<3(nTGC»AAGGCXK}TGGACnTKnACm3TC!GAGTCT 

TGCGGlXXOCGGlCITCACGGACAACimTC(XCT^^ 

CCAGGTGG<XCATCTACACXKXXX7rACKKn'AGCGGCAA 

GGCrGOTATGCAGCXX^AAGGCO'ATAAGGTGCTrQT^^ 
5 G<XACanAGGTITCXKK3GCGTATATGTCTAAGGCACATGGTATC»AC^^ 

TCAGAACCGGQGTAAGGACCATCA(XACGGGTXKXCCCATCACGTACr^ 

T(KK:AAGTTTCITGCCGACXKm3GTTGCr<^^ 

TGTGATGAGTGCCACTCAACTGACTCGArcACrATCCrGGGCAT^^ 

TGGACC:AAGCGGAGAaK]KrrGGAGCGCGACKX5TCGTGC^ 
10 CimKKjATCGGTCACCGTGCCACATCCAAACATCGAGGAGGTGGCI^^ 

CACrGGAGAAATCCX:CTITrATGGCAAAGOCAiaXX:ATC^ 

GGGAGGCACXTCATTITCIXKX^ATltXIAAGAAGAAATGTGATQAGC^^ 

AGCroTCCXSGCX^TCGGACTCA/^TGCnxn'AGCATA^^ 

CmtlATACCAACrAGOKSAGACXnCATTGTCGTAGC^ 
15 GGCITTACXXKKXSATITCXSACrCAGTGATOGACIXKIAATACA 

CAGTCGACITCAG<XriXK3ACXXX3ACXnTCAa:^^ 

AAGACXKXiQTGTCACXKnXXK:AGCX3G(XSAGGCAGGACnXK^ 

GCATITACAGGTITCTGACTOCAGGAGAACGGaXnXXKK^ 

GGTTCTCTGCX5AGTGCrATCACGCXK3GCIGTGCTrGGTAC^ 
20 GAGACXJrCAQTTAGGTraCGGGCrrACO'AAACACArcAGGGTI^^ 

AGGAOCATCroGAGTrcKMSGAGAGOGTCrrrrACA 

(XATTTCTItnXXCAGAC7rAAGCAGG(l\GGAGAC^C^^ 

TACCAGGCTACGGTGTGaKX:AGGG(nCAGGCrcCACCTCCAT^^ 

TGTGGAAGTGTCrcATACGGCrAAAGC(:TACGCrcCAaKKK;CAA(^^ 
25 GTATAGGCTGGGAGCXXJlTCAAAACGAGGTTACTACX:ACACACm:AT>^ 

ATACATCATGGCATGCATGTCXKKnGA(XlGGAGGTCGTCACGAGCA<XTGG^ 

CIGGTAGGOKJAGTCCTAGCAGCIOXXKXGC^ 

TGGTCATTCTGGGCAGGATCATCTKnX^ 

GGAAGIXXTITAOCXXKJAGTTCJGATaAGATGGAAGAGTGCXS^^ 
30 TACATCGAACAGGGAATGCAGCIXXKXX5AACAATTCAAACAGAAGGC^^ 

TTGCnGCAAACAGCX:A(Xl\AGCAAGCGGAGGCTCKnG^^ 

AAGTXKXXKSAOOCTOGAAGOCTTCtGGGCGAAGC^ 

GGATACAATATTTAGCAGGCITGlXXIACnCTXKXnGGCAACCC^^ 

ACIGATGGCATTCACAGCXnurATCACCAGCX:CGCrCA^ 
35 CIX3TrrAACATCCTGGGGGGATGGGTGGCCGCXX:AACTrGCT^^ 

GCJTCTGCITKXn-AGGCGaXKKllAT^^ 

iTGGGAAGGTGCTIGTGGATATTnXSGCAGGTrATGGAGCAGGGGTG^ 
. GCrrCGTGGanTrAAGGTCATGAGCGG<X5AGATGaXnXXA(^ 
AA<XTACnXX:TGCTATCXnCimxriGGO^^ 

40 AGC!GATACTCOGTCGGCA(XlTGGGCXX:AGGGGAGGGGGCKnXK^ 
CaKKnGATAGOGTTCGCTIXXaKKXn'AACCAC^^ 
C(nGAGAGa3ACX3CKK:AGCACmxnx:ACTCAGATCC^^ 
CTCAGCrGCTGAAGAGGCrrCACCAGTGGAT^ 
CIXXGGCTO3TGGCrAAGAGATGTITGGGATIXK3ATATG^ 

45 TTCAAGACXnGGCIXXAGTCCAAGCKXriGi^ 

CTCATGTCAACmGGGTACAAGGQAGTCrGGCXKKKKXJAC^^ 
CACCrGCCX:ATGTCGAGCACAGATCAOCX3GACATGTGAAAAACX3GT^ 
GATCGlXKKSGCXnAGGACCTXnAGTAACACGTGGCATGGA^ 
GaSTACACCAOSKKKXXXTGCACXKXXJrCCCa^ 

50 TGTGGCGGGTGGCTCCroAGGAGTACGTGGAGGTrACXSaKKnX^^ 
ACTACX3TGA(XKKKIATGACX:ACTXjACAAdGTAAAG^^ 
GGa:CCmAATItjrTCACAGAAGTGGATGGGGTGCXKm'(^ 
GCGTGCAAACXXXJKXn-ACXjGGAGGAGGTCACATrccnXKj 
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ACX:iGGm3QGTCACAQCrCCX:AT^^ 

TTCCATCKTCA<XGACC(XT(XCACATTACX3GCGGAGACXKK7r^ 
GCCAGGGGATCICCCXXCrCCITGGOCAGCrcATCA 
CnrCOTGAAGGCAACATGCACTAaXXyrcATGACTC^^ 
5 (X3AGG<XAACCrccrroTGGCGGCAGGAGATGGGaKK}AACATCAC<XX3^^ 
GTCAGAAAATAAGGTAGTAATTITGGACTCTITOGAGCTOCTC^ 
GATGAGAGGGAAGTATCCGTrcaXKXSGAGATCCTOa^ 

<xtcgagcgatgcxx;atatgggcaogcccggattacaacccrcx:aot 
<xtggaaggacxx:ggactacgtcxxtccagtggtacaogggtgtccatix^^^ 
10 tga:aaggcccx7ixxgatacca(xtccaak3aggaagaggacggtt^ 
gaatcracxxjtgtctitjkkxtn'ggcggagcixxkjcacaaag^ 
cogaatcgtcxkkxxncgacagoggcacggcaa<xkkxt^^ 

a}A(X3ACGGaiACGCXKKjATCXX}ACGTroAGTCm'ACTC^^ 

GAGGGGGAGCOGGGGGATCXXGATCICAGGGAOKKnxnTGGlCT 
15 GAGGAGGCTAGTGAGGAOnX^GTCTGCTCCTCXJATGTC^ 

GCXXrroATCAOG<XATGCGCTCOGGAGGAAAOCAAGCnX3CXX^ 

AGCAACTCmXKTKXXnCAOCACAACriX^ 

CAAGCXnXKXKKIAGAAGAAGGTCAOCTTraACAGACKKIA 

ACTA(XGGGACGTGCrcAAGGAGATGAAGG(mAGGCX3TCCACAGTrAAGGCT 
20 AACITCnATOOGTGGAGGAAGCCIOTAAGCIXiACGC^^ 

TAAATTKKKTATGGQGCAAAGGACXmXXKSAACJCTATOCAGC^ 

<XACAlXXGCIXXGTGTGGAAGGACTKKnXK5AAGACACTOAGACA0CAAl^ 

Aa:A<X;ATCATGGCAAAAAATGAQGTITICIt3^^ 

OQCAAGC3CAGCrcX3CX:rn:ATCX3TATrcCX:AGATT^^ 
25 AAATGGCCCITrACGATGTGGTCIXX:AaxnXXCrcA 

TACGGATTCCAATACTUIXXnXKJACAGCGGGTCGAGTT^^ 

AAGCGAAGAAATGCCCrATGGGCTrcGCATATCACACCX:»CTGTI^ 

GGTCACraAGAATCACATCXXnGTTCAGGAGTCAAT^ 

GCCmXJAAGCXJAQACAGGOCATAAGGTOGCrcACAGAGiXKK:^ 
30 GGCC(XCroACrAATICTAAAGGGCAGAACIX3aKKrrATCGrc 

GCGGTGTACmAC»ACCAG<nGCXKnAATACCXnx:ACATGTrACTro 

TGOGGCXZriXnCXSAGCrGCXSAAGCIXXAGGAC^^ 

GACCTKnXXJITATCnxm3AAAG<XK:GGGGAax;AAGAGGAC^^ 

CXKKKXJITCACXKjAGGCTATGACTAGATACTC^^ 
35 AACCAGAATACGACirGGAGTTOATAACATCATGCTOCKX^AATQT^^ 

GCACGATGCATCn<KK;AAAAGGGTGTACrATCrrCA0(XX3TGACC^^ 

CiraaXXKKKJKKXmKKJAGACAGCTAGACAC^ 

GCAACATCATCATGTATGCeCXXACXjrnjKKS^^ 

TITCTIUrcCATCX:TKnAGCTCAGGAACAAC^ 
40 TCTACXKSGGOCTOTrACTOCATTGAGOCAC^ 

ACKX^ATGGCCTTAGCGCATriTCACTCCATAGTTACTCTC^ 

GGGTCGCITCATGCXnCAGGAAACTrGGGGTACX3GCXXnTGC^ 

TOGGGOIlAQAAGTGTOtXSCGCrAGGCTACKnC^ 

TKJrGGCAAGTA<XTCTrcAACTGGGCAGTAAGGACX:AAGC^ 
45 ATCCCGGCriXKX3TCCX:AGTKKmTrAT^ 

GGQAGACATATATCACAGCOtnCTCXnGCmSACXXXXXnXj^^ 

CTACrcCTAClTltTItn'AGGGGTAGGCATCTATCTACTC 

GGACX^TAAACACnXXAGGOCAATAGGO^ATCCimTriTITC^^ 

1111 1 1 1 1 1 11 1 1 11 11 111 1 111 ilTni ITl CIC Cll 1 1 iiTl i €CT LimTn CC^ 
50 TTCTITanTrGGTGGCia:ATCTrAGCXD^^ 

GTGAGOXKTraACnKIAGAGAGTGCTGATACTCGOC^^ 
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SEQIDNO:23: Nucleotide sequence ofMJengfliHCVcDNA clone cimtainingflje 
mutation Omt results in Ser to He at position 11 79 of SEQ ID N0:3, and wbere tbe 5' NIR is 
fiised to flie neooiycin phosphotransferase gene and flie EMCV IRES is inserted upstream of 
Ae HCV open leading fiame (see Figure 9) 

5 

CKXIAGCXXXXXSATIXKKKKKXJACACrcCAaiATAGATCAC^^ 
TACraTCriTCA(:XK:AGAAAGCGTCTAGCCATGGCmTAGTATGA 
(XnXX:AGGA<X(XXXCnxaKK}AGAGCX:ATAGTGGTCTGC 
C^aXKjAATTGCCAGGACGACXXKSQTXXnTrCTTGGATC 

10 GGAGATriX3GGCGTGCCC(XXKX3AGACrGCrAG<XX}AGTAGTGT^^ 
AAGGCCTTGTGGTACTGCCIGATAGGGTGCnrrGCGAGTGC^^ 
AGAOOGTGCACX:AT6AGCAOGAAT0CTAAAGCnC!AAAGAAAAACX:^^ 
GCCATGATIX3AACAAGATGGATIGCA(»CAGQTKnxXGGCXXKTIGG<^^ 
GGCTATTCXKKTATGACnXKKK:ACAACAGACAATC«GC^^ 

15 GTnXGGCIXnCAGCXKJAGGGGCGCCCGGTTCTITrrG^ 

GGTGCCCTCAATGAACTGCAGGACXiAGGCAGCGajGCTATC^ 
ACXKKXXm^CrrGCGCAGCrcTGCrCGACGTrQT^ 

ggcixxtatixkkkx5aagtgccggggcaggatctcx:tgt^ 
tgccgagaaagtatcxiatcatggctgatgcaatgcxkkx^^ 

20 CCGGCrACCrGCXX:ATrcXMCCACX::AAGCX3AAACATC^ 

ACTOKjATGGAAGCX^GGTCTKnCGATCAGGATGATCT^ 

GGGCrCQaKX:AGaX5AACIGTrCG(X:AGGCI^^ 

GAGGATCTCGTOGTGACXX:ATGGCGATGCCTGCTrGC^^ 

ATGGCCGCITrTCrrGGATrCATC»ACrGrGG 
25 TCAGGACATAGanTGGCrAOXXmjATATTGCTQAAGAGCnXKKXK^^ 

GCrGACXXKnTXXriCGTC<nTrA(XKnAT^^ 

(HTCTATCGCXJmnTGACXSAGTrCTrcrcAGl^ 

CTCTAG<XK3GATCAATTCX:X3aXX:rC^^ 

AAGCXXKnTGGAATAAGGCCXKnGTXKXJTITCTCTATATC 
30 TGCXXJTCTTTIGGCAATGTGAGGGCXXXKSAAACOGGCCC^^ 

CAlTCXTTAGGGGTCnTrCXXJCTCTCGCX^AAA 

GTGAAGGAAGCAGTItXTCrrGGAAGCTrCTrcAAGACA^ 

ACC0TIXK:AGGCAGCX3GAACXXXXX^CX7IX3GCGACAGGTGC^^ 

AGCX:AanX}TATAAOATACACXnYX:!AAAGGCXKK:ACAA 
35 TGAGTroGATAGTKJTGGAAAGAGTCAAATGGCICrCCnrAAGCGTAT^^ 

GGGGCTGAAGGATGCCX:AGAAGGTACCCCATroTATGGGATCTGAT^^ 

CJGGTCCACATGCrrrACyiTGTGTITAGT^ 

GAACCACXKKK}ACGTGGTrria:TrreAAAAACAOGATAATAATCAGCAC» 
CCTAAA<XrrCAAAGAAAAA(XAAACXn'AACA(XAA<XGOaKXX:AC^ 

40 AAGTTC(XXKK3CXK3IXKm:AQATC!GTOGGTGGAG 

GCXXX^AGGTKKSGTGTGCXKXKXiACrAGGAAGACTrCOGAG 
GTGGAAGGCGACAACOAlOXXilAAGGCTCXMXJAQCmSA 
CTCAGOXGGGTACXXXnGGCXXXrrCTATGGCAATCAGGGC^^ 
ATGGinXXJTGTCACOCXXnXKKncrOGGCCrAGTIGGGe^^ 

45 CX3TAGGTCGCXK:AATrreGGTAAGGT€ATOGATAC«7rc 
ATCTCATGGGGTACATiaXHJimiaKKXK^^ 
CXnXKKXKATGGCX nCCGGG TICTGG AGGACG G CXj^ 
IXnGCCaKSTKMntXHTTTCTAT^^ 
(XCAGCITOaKnTATGAAGTGCXKL\ACGTAT^^ 

50 GACTGCTCXIIAAOGCAAGCATTCTGTATGAGGCAGC^ 
(XXKKJTGCGTGaxrKKXnrOKKJAGAACAAC^^ 

CACixxK::AcwcrixxKXKicx:^^ 

CATGTCXJATTKKnCXnTGGGGCXKHnGClU^ 
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ATcrcnxKmATCixmTTCxnancxKxx^A 

CACGAGACAGTACAGGACKKJAATTGCrcAATATATCCOSGCCACGTGACAGGTC 
ACCXn-ATGCKnTGGGATATCATCATGAACroGTCACCTACAGC^ 
ATCXKIAGlTACrOOGGATCaiACAAGCTCTC^^ 
5 TGGGGAGTCCTAG<XKKJCCrmK:CTACTATTCCATGGTGGGGAA 
TrCTCATIXn"GATGCTACnXJITKKXXK^ 
GGGGAaJATGGOCAAAAACAaxnXXKKJATTACXjTaXJrCm 
T(XX:AGAAAATCCAGCrTGTAAACA<XAAa3GCAGCTGGCACATC 
GCCXJTGAACJrGCAATGACTCXCrcAACT^C^^ 

10 GCACAAGTTCAACnCATCKKSAKKXX^AGAGCX^ 

GAOKXnTCXKnXJAGGGGTGGGGGiXCATCACnTACAATGAGTC 
ACCAGAGGCCITATIGTrGGCACrAOGCACXXXXKKXX3TGCXK5TATC^ 
GGCGCAGGTGTGTGGTCX:AGTGTACIXKriTCACXXX:AA(^^ 
ACGACXXjACXXKnTCGGanXXXJTAan'ACAGTrGGGGC^ 

15 GTG<nXKnTCriTAACAACACXKXKKXXKXXX:AAGGC 

GGATGAATAGCACriXKKnTCA<XAAGACmX]KDGGGGGCXX^ 
GGGGGATCKKK^AATAAAAOCTTOACXTCaXXIA^ 
CmAGG<XACTrACAa:^GTGTGGTroGGGGOCTIG^^ 
GGTCCACTAaXATACAGGCnrGGCACTACXXXn^ 

20 TCAAGGTrAGGATGTA<XriXKKKK3GAGlXM3AGC^ 

ATrGQACTCGAGGAGAGCGTrQTAAaTOGAGGACAGGGACAGATCAGAQCTTA 
GiXCGCIGCTGCTCICTACAACGGAGTCGCAGGT^ 

anAaxKKJixjiGTCx:AcreGTriGAT^ 

AATAanOTACXSGTATAGGGTCJGGOGGTIXnurcCTI^^ 
25 TGTCCKyrrGClXnTCOT^ 

TGATGCTGCraATAGCTCAAGCroAGGCCGCCCTAGAGAACCKKj^ 

aKXKK:ATCCGTGG<XGGGGCXK:ATGG<:AT^^ 

CIGCXnXKJTACATCAAGGGCAGGCnGGTCCClXKXXK^^ 

CXnATGGCXIJGCTACIXXTOCIXXnGCTt^^ 
30 GACX:GGGAGATGGCAGCATCGTCCXK}AGGCX]KDGGTITI^^ 

TGACXnTGTCAaXK:ACTATAAGCIGTIXXTOGCTA(^^ 

TATrrrATX:ACCAGGG(XXlAGGCAC^CnTGCAAGTO^ 

TTCGGGGGGGCXXK^ATGOMTCATOCTCCrcAC^^ 

AATCTITAO^TCAOCAAAATCTIGCnrCXXXIATAC^^ 
35 AGGCroOTATAAaZAAAQTGCXXn'ACmtOTXKXK^ 

AlXK:ATGClX3GTGaM3AAGGTroCTtKK3^^ 

AAGTTGGCXXk:ACroACAGGTACGTACXmTAiX}ACX:ATC^ 

ACIGGGCCXAaKXKKKXn'ACXJAGACXJITGC^ 

CICroATATGGAGA(X:AAGGTrATCACOGGGGGGCAQACAC^^ 
40 GGACATCATCTIXK3GOCTtKX3CGTCroOGOC^^ 

GGAOXKKIAGACAGCCTrGAAGGGCAGGGGTGGCGACIXXT^^ 

GCCTACnxXCAACAGACXKXSWlGGCCrACTKK^^^ 

GaXKK3ACAGGAACX:AGGTa5AGGGGOAGGTa:AAGTGGTCT^ 

AATCTITCO'GGajACXrraCXnCAATGGCGl^^ 
45 GGCTCAAAGACXXnTCOXSGaXAAAGGGCXXIAATCAC^^ 

GTGGACCAGGACCTCGTOGGCTGGCAAGCGCCCCCXX^^ 

CATGCACXnXKXKKAGCnXXKSACXriTrAClTGGTC 

TCCGGTGCXKXGGCCGGGOGACAGCAGGGGGAGCCTACn^ 

CIXXTA(nTGAAGGGCIOTCGGGCXjGTCX:AC^^ 
50 GTGGGCATCTmXKKKnGCmreTGCACXXX5AG(^^ 

TIGTACXXXnX3GAGTCrATGGAAACX:ACrATG0GGTa^^ 

crcGTaxxnxxx3GcxxnAa:xK::AGAc:AT^ 

ACKKJTAGCGGCAAGAGCACTAAGGTGiXGGCTGCXirATGCAGCCCAAGGGT^^ 
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AAGGTGCITGTCOX5AAaXX3TCmiaXXX3^ 
TCTCTAAGGCACATGGTATCGACCCTAACATCAGAACrGG^^ 
CX:ACGGGTG(XC<XATCACQTACIXXA(XTATQG^ 
TTGCTCTIXKXKKKXKXTATGACATCATAATATGTOATOAC^^ 
5 TCGAa:ACTATCCTXKKK:ATCGGCACAGTCX7KKiACX:AA(^^ 
GOKXSACTCGTCGTGCiaKJCACCGCTACGCXirr^^ 

AT<XAAACATCGAGGAGGTGGCnxnxnx:CAGCACTX3GAGAAATC<XX:riTrrAT^ 
CAAAGCCATaXX:ATajAGACX:ATCAAGGGGGGGAGGCAan'CATITIUrGCCAT 
TCCAAGAAGAAATGTGATGAGCrOJCCGCGAAGCKn'CCGGCCIXXK^ 

10 GCnX5TAGCATATrACCGGGGCCTroATGTATCmrcATA(XAACTAGCGGAGAC^ 
TCATTOTCGTAGCAACXXJAOQCICrAATGACGGGCm 
AGTQATOGACroCAATACATGTGTCACXXAGACAGTroAC^^ 
ACOTCACX:ATroAGACX5ACX3A(XGTG<XACAAGACX3a^^ 
GGCGAGGCAGGACTGGTAGGGGCAGGATGGGCATITACAGGTITQTGACT^^ 

15 GAGAACXKKXXnOKjGCATGTrcXSAlTOCrcGGT^^ 
GGGCTGTGCTIXMn"ACXjAGCnt:A(:XKXX:GCCGAGAC^^ 
TACXnAAACACACCAGGGTrGCCCXnxnX3CX:AGQA(XAlXn^ 
GanxnTTACAGGCJCrrCACXX^ACATAGACXKXXATI^^ 
GGCAGGAGACAACnrOCCCTACCKKSTAGCATACCAGGCTACGGT^^ 

20 GGCTCAGGCIXXACCTCCATCGTGGGACXAAATOIXKjAAQT^^ 
AAGCXTAOKnXKJACGGGCXZAACXKXXXrKKnxn'ATA^^ 
ACGAGGTTACrACCACACACXXXATAACCAAATACATCATGGCATG<l\TC 
TGACCIGGAGGTCXnx::ACX3AGCACCrGGGTGCIGGTAGGaK3AG^ 
CTGGOXKXSTATTGCCTGACAACAGGCAGCmXKnCATT 

25 TGTOCGGAAAGCOGGCCATCATimXJACAGGGAAGTCO^ 

TGAGATGGAAGAGTGOKXrrCACACXnmnTACATCGAACAC^^ 
GCCXjAACAATrCAAACAGAAGGCAATCGGGTTGCrGCAAACAGCX^ACCAAGCAA 
GCGGAGGCTGCKK:riCCOGTGGTGGAATCCAAGTGG 
GGGOGAAGCATATGTGGAATTTCATCAGCXKS^^ 

30 CACTCTGCCTGGCAACXCXGCGATAGCATCACnXJATGGCATn^ 
ACCAGCXXGCnXii^CCACOCAACATAaxnC^ 
TGGOOQCCXIAACTTtKnXXriaXAGCGCn^^ 
GCKjGAGCGGCTGTIGGCAGCATAGGCXJITGGGAAGGTGCT^ 
CAGGTTATGGAGCAGGGGTGGCAGGOGOKrrCXniGGCOTrAAGGT^ 

35 GCXJAGATGCXXntCACXXjAGGACCIGGTTAACXrrACrcC^^ 
GGaXXXTAGT0GTCGGGGT0GTGlXKXK:AGCX3ATAC^^ 
CAGGGGAGGGGGCKnGCAGTGGATQAACXXKKnCATAGCGTr^^ 
GTAA<XACGTCTXXCa:A(XK::ACTATGTGOCIX3AGAGCGAO^ 

CA(nx:AGATanciCTAGTCTrACx:ATCAcrrcA 

.40 GQATCAACGAGGACroClXXAaKXIATGCl^^ 

GGATKSGATATXK^AOKnxnTGACTCATrrcAAGACXrr^^ 

CIXKJOKXfATTGCXXJGGAGTCXXOTCTI^ 

TCKKKXKKKKXSACJGGCATCATQCAAACCACO^ 

(XGGACATQTGAAAAAOGGTTrcATGAGGATCGTGGGGCXnAGGACCT^ 
45 ACACGTGGCATGGAACATKXXX^ATTAACGajTACACCAaKK^ 
CIXXXXXKK:G«L\AATrATTCrrAGGGCXKn-GTGGCXK^ 
GTGGAGGTTACGCGGGTGGGGGATITCCACTACGTGACGGGCATGACCACnXS 
AACOTAAAGltKXXXnGTCAGGTimjGCXXCCGAATI^^ 
GGGTGOKSTrGCACAGGTACXKTrCCAGCGTGCAAACCCC^^ 
50 ICACATTCXnGGTCGGGCrcAATCAATACOGC^^ 
G<X!C3GAA(X3GGAOGTAQCAQTGCICACTTtXATGC^ 
AaKKX3GAGACX]KKnAAGCGTAGGCTGGCCAGGGGATCrrcC^^ 
AGCrcATCAGCTATCX:AGCTX3TCnXK:G(Xr^^ 
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GTCATGACTC<XCXK}ACGCroACCTCATCX3AGGCX:AACCTC(^^ 
GATGGGCGGGAACATCACCCGCGTGGAGTCAGAAAATAAGGTAGTAATTTTGGA 
CTCTTTCGAGO^GCIXXAAGCGGAGGAGGATGAGAGGGAAGTATCCGTrCCGGC 
GGAGATCXTGCGGAGGTCCAGGAAATIXXCrCGAGCGATGCCCATATGGGCACG 
5 <XCGGATTACAAC(XrcCACTXnTAGAGTCXnGGAAGGA(X(XK3ACTACGTC^ 
CCAGTGGTACACGGGTGTCCATTGCCGCCnXKX^AAGGCCCCrcCGATACCAC^ 
CACGGAGGAAGAGGACGGITGTCCIGTCAGAATCTACCGTGTCTICTGCCI^^ 
GGAGCTCGCCACAAAGA(XrrcGGCAGCKXX3AATCGT^^ 
CACXKK^AACGGOJWmxnGACCAGCXXriXXGAra 

10 CXnTGAGTCX3TACTCCrcCATGCaXC(XTrcAGGGGGAGa^^ 
CK:AGCGACGGGTCnTGGTCTAOCGTAAGajAGGAGG<nAG^ 
GCTCCnx:GATQT0CnACACAtXK3ACAGQC^ 
GGAAACCAAGCIXKCXATCAATGCACTOAGCAACntrnTCC^^ 
TTGGTCTATGCTACAACATCrcGCAGCGCAAGCXnGCXKK:^ 

15 lTGACAGACrGCAGGTOCrrGGACmCX:ACTA<XXK^ 

AGGCGAAGGOGTCCACAGTTAAGGCTAAACTICTATCCGTGGAGGAAGCCT^ 
AGCTGA<XKXXXX^CATrcXKKX::AGATCTAAATIT^ 
CXXSQAACCTATOCAGCAAGGCXXm'AACCACATO^GCrC^^ 
CIGGAAGACA(nGAGACA(XAATroACA(XA(X:ATCATGGCAAAAAATGAGGCT 

20 TTCTGCGTCCAACCAGAGAAGGGGGGCCGCAAGCCAGCrcGCCITAT^ 

CAGATTTGGGGGTrcGTGTGTGCGAGAAAATGGOCXTITACGATGTGGTCT^ 
(XTCCXnCAGGCOnGATGGGCnxzriTCATACGGAT^^ 
CGGGTCGAGTKXnGGTGAATGCCTGGAAAGCCAAGAAATGCCO'ATCG 
CATATGACACXXGCnX3mTroACKlAACXKnt:ACTO 

25 GGAGTCAATCTACCAATGTIGTGACTK3GCCXXXGAAGCO^GACAGG^ 
GTCXKJTCACAGAGCXKKnTrACATaKKKKKX^ 
AACroCGGCTATCGOCGGTGCCGCGCXJAGCXKmS^ 
AATACXXrrcACATGTTACTTOAAGGCXXKnGCXSGCOXj^ 
AGGACTGCAaJATGCrcGTATGOMiAGACGACXnTGTC^ 

30 GGGGA«X:AAGAGGAaiAGGCGAG<XTAOGGGCXrrTCy^CGGA<^^ 
ATACnCIXKXXX:CCCrGGGGA<XXXKXX:AAA(X:AGAATAC»A<^^ 
ACATCATGClXXriXXAATGrGTCAGTCGOGCACGATGCAl^^ 
ACTATCTCAOXGTGACCXXLfVCCACmXXnTGKXJ^^ 
TAGACACACIXXAGTCAATIXXTGGCTAGGCAACATCATCATGrATGKXKrC^ 

35 TTGTGGGCAAGGATGAIXXTOATGACnXIAlllUriCTa^ 

ACAACrrGAAAAAGCCCTAGATK3TCAGATCTACX3GGGCX:TCTrAC^^ 
CCACnGACCnACCTCAGATCATrcAACGACI^^ 

ccatagttactctcx:aggtgagatcaatagggtggcti^ 
ggggta<xx3<xxntgotagtctogagacatcxkkkx:aga^ 

40 CTACIGTOCCAQGQGQGGAGGGCKjCCACrroTCGC^ 
CAGTAAGGACCAAGCTCAAACrrcACIXXAATCXXXK^^ 
ATCCAGCIGGTTCGTIGCrGGTrACAGaKKKSGAGACATATATC 
CGTGOXGACmXKJIGGTTCATfflGGTGCCTACTt^ 
CATCTATCTACIXXCX:AACXX5ATGAACX3GGGAaTrAAACA 

45 CCATCCTOTITITriXXCTmT^^ 

TriTCT(XTriTrrriictnuriirrriu;iTiiurri^ 

CXXHAGTCACXXKn'AGCTGTGAAAGGTCXXrraAGCJCX^ 
TGATACnKKXnUTCTGCAGATCAAGT 

50 

SE0IDNO:24: Nuckotide sequence of fidMengfli HCV ciDNA ckne containing fbs 
mutattoa that results in Ser to He at position 1179 of SEQ ID N0:3 (see Figote 9) 
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GCCAGCCCmSATlXKKKKKXJACACTOCAC^ 

TACTGTCrrcACGCAGAAAGCGTCTAGCCATGGCGTrAGTATGAGTGTOT 
CCTCCAGGA(XCCC(XrrcCCGGGAGAGCCATAGTGGTCIX3CGG 
CACCGGAATTGCCAGGACGAaXKKn€CTITCTKK5ATCAA<XC^^ 
5 GGAGATITGGGCGTGCCCCCGCGAGACTGCTAGCCGAGTAGTGITGGGTCGCGA 
AAGGCOTGTGGTACIGCXTroATAGGGTGCrKJCGAG^ 

AGA(XGTGCA(XATGAGCACX3AATCCTAAACCrCAAAGAAAAACCAAACGTAAC 

ACCAACCGCXXK:CCACAGGACGTCAAGTr(XOGGGCGGTGGTCAGATCGTCGGT 

GGAGTTTA(XTGTKKXGCXKIAGGGGCCCCAGGTTGGGTG1^^ 

10 AGACITCCGAGCXJGTOGCAACCTCXSTGGAAGGCGACVVACCTATC^ 

GCX:AGCXXGAGGGTAGGGCOXKKKnCAG0(XGGGTA(XXXTCG0CC^^ 
CAATGAGGGCTTOGGGTGGGCAGGATGGCrCCTGTCAOCXX^ 
AGTTCGGGCXXXIACXKiACXCOaKKXjTAGG^ 
ATACCXJTCACGTmKKnTOGCCGATCrc^^ 

15 CCXXCTAGGGGGCXKTOOCAGGGCCXTOGCGCATQGCXn^ 
CXKKXnXSAACTATGCAACAGGGAATCnXXXXKjTIXK^ 
TGGCmXKnXSTCCTGTITGACXiATCX^ 
TCXXKSAGTGTACXIATGrcACGAAaM^CTCCTa:^^ 
CAGaK3ACATGATCATGCATAaXXXXXK3TGCX3TGC^^ 

20 ACKXnx:CCX3CKKJIX3GGTAGCXKnx:ACTCXX:AC^^ 

CGTCXXCACTACGACGATACXSACXKXATGTOGAlTnK^^ 
CnxntXnXXXKn-ATGTACXSTGGGAGATCIUKKXKSAT^ 
GCTGrnCACCTTCTOCXXnXXXCGGCACGAQAC^^ 
ATATATCmKSCCAOGTGACAGGTCACaSTATGGCnTGGGAT^^ 

25 GGTCACOACAGCAGCCXTAGTQGTATCXKIAGTrACTCCXK} 
anXK5ATATGGTGGC!GGGGG0a}ATIXKKKTAGTCCT^ 
TCCATGGTGGGGAACIXKKKn-AAGGTTCTOATIXnXJAT^ 
TTGACXKKKKSAAOnATGTGACAGGGGGGACGATGGCCAAAAAC^ 

TrACxntxx:riciTnt:A(xx:GGGTCATC^^ 

30 CX3GCAGCKKK;ACATCAACAGGACnX3CXXnGA^ 
GGGTXaJTTCCIGCXKnmTCTACmX^ 
AGCXKJATQGCCAGCnXK^AGCCCCATCGACXXXmx:^^ 
TCACTTACAATGAGTCACACAGCTCGGACCAQAGGCXnTATKn^ 
ACCXXXKXXXnXKX3GTATCGTA<XCXK:GGCXX:AGGTGTGTGG^ 

35 TrcA<X<XAAGC(XrKnx:GTGGTGGGGACXjACCGAC(XK^^ 

ACAGTIGGGGGGAGAATC^GA(XK3A0GTGCl<KTItTrAACAACAC^^ 
OSCAAGGCAACIXKmTGGCKnACATGGATGAATAGCAC^^ 
GTGaKXKKKXmXXnxnV^CATOKKJG^ 
CCCCACXKSACTGCrno^GGAAGCACCmSAGGCJ^^ 

40 GGGanTXKnTGACACOCAOATGCTKKnXXA 

A(xccixx;ACKnx:AAcnTrAccATcrrrcAAGGi^ 

GGAGCACAGGCItX3AAGCCXK:ATGCAATIXK3ACTt»AGG 
GGAGGACAGOGACAOATCAGAGCrrAGCXXXXrrGCIGCnST^ 
GCAGGTATKKXX7R3TrOCriTCACX:AOCCTAOCX3GC^^ 
45 ATCrcCATCAGAAOGTCOTtjGACXn'ACAATACXnXn'ACXKj 
. im-CTCCirroCAATCAAAT^^ 

ACGCXKXiCXntnXjTGCXnXKTIXntKSATOATGC^ 
OKXXZTAGAGAACCroGTGGTCCrcAAOGCGGCAT^^ 

CATiciurccrTCx:nxxmnT^^ 

50 TCOCTtKKKKXKKlATATGCXXnxnACXXK^ 

GOm'ACXiiACXilACGAGCATAaKX^ATGGAO^^ 

GGCGCxKnrrixxnAGGTCTCATAcrrcriTGAa^^ 

CCKXKn-AGGCrcATATCGTGGTTACAATATITrAT^^ 
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TTGCAAGTGTGGATOCCCCXXXnXilAAarn^^ 
TCCrCACmXK:GOGATC!CACCCAGAGCrAATCTITA<XATCACCAA 
G(X:ATA(naKnCX:ACrCATGGTGCn€CAGGCnXKJTATAA(X;AAAGIT 
TCX3TGCGaK:ACACGGGCTCATTCX3TGCATGCATGClXjGT^^ 

5 GGGTCATTATGTCKAAATGGCICrcATGAAGTT^ 

GTrTATGACCATCrcA(XCX:ACnmK3GACTXKKKX:CACG<X^ 
TTGCGGTGGCAGTTOAGCXXXSTanxnTCIX^ 
CTGGGGGGCAGACACXXKXKKXnmGGGGACATCATCri^^ 
GCXXGCAGGGGGAGGGAGATACATCTGGGACXXKKJAGACAGCCTTGAAGGGCAG 

10 GGGTGGCGACIXXnXZXK^GCXnATTACGGCXTAC^^ 
TKKSCTGCATCATCACrAGCOCAC^ 
AGGTCX:AAGTGGTCTOCAaXK:AACACAATCTrT^^ 
CXSTGTGTIGGACKnOATCATXKnXXXJGGCrcAA^ 

GGCXXAATCAOCCAAAimACACX^TQTGGACCAGGAajrcGTCGGC^^ 

15 GCXKXXXXXXKKXKXKXmCCTroACACX:ATGCACmXXK^ 
ACTKKjTCACGAGGCATGIXGATGTCATIXXXKjTG^^ 
GGGGGAGOCTACICimXX:AGGCO0GTCTCCrAC^ 
TCCACrGCICIGCOOCrCXKSGGCAa^^ 
AC<XX3AGGGGTKKX}AAGGCX3QTGGACTITGTAaXC^^ 

20 ACTATGOKnmXXKnCITCACGGACAACT^^ 
CAlTCX::AGGTCGC(XATCTACACGCaxnACTGGT^ 
GCOGGCriX3CX3TATGCAG<XX::AAGGGTATAAGG^ 
GCCX3a::A(XCTAGGTITCGGGGCGTATATGTCTAAGGC:A<^^ 
ACATCAGAA(XGGGGTAAGGACCATCA(XACGGGTGaXXX:ATCACGTACn^ 

25 anATGGCAAGTTTCTTGOCXJACXKnGGTTGCTCr^^ 
AATATGTGATOAGTGCX^ACTCAACraACrcGACX^ACT 
GTCCIXjGACCAAGCGGAGAaKKn-GGAGCGCGACI^ 
ACGCOXXXKjGATaKnCACCGTGCXJACATCCAAACATCGAGGA 
CX::AGCACrGQAQAAATCXXXnTrrATGGCAAAGCCA 

30 GGGGGGGAGGCACXJTCATITICIXKXJATKX^i^ 

Ga5AAGCrGTC0GG<Xnx:GGACrcAATGCTCTAGCATATTACa^^ 
TATCXZXjTCATACXAACrAGCGGAGACXnX^ATTXnXX^ 
OA<XK3GCITrAOCGGCGATITCX3ACrcAGTGATCGACIGCAATAC^^^ 
CAGACAGTCGACTrcAGCXnXK3ACXX»ACOTCACX:ATraAGACGACX5AC^^ 

35 CACAAGA(XKX3GTGTCAaKrrcGCAGCGGCXjAGGCAGGA<n^ 
TGGGCATTTACAGGTITGTGACIXXAGGAGAAOGGCCCIXXK^ 
CKXK3TIOXnGCX?AGTGCrATGA0GC^^ 
GCXGAGACCrcAGTTAGGTKKXKKSCTrACCTAAACACACCAG 
GCXiAGGACCAlUKKSAGTIOXKKSAGAGCXnCITrACA 

40 aXXXATircrmntXXAGACTAAGCAGGCAQ^^ 

GCATAlXAGGCTACGGTGlXKXKXAGGGCrCAGGCTO::^ 
AAATGTGGAAGTGlUrc^TACGGCTAAAGCO'ACXKIIX^ 
G<nGTATAGGCIXK3GAGCXXnTCAAAA0GAGGTrACTA(X^C^ 
AAATACATCATGGCATGCATGTCXKKTOA<XTGGAGGTCGTCACGAGCAanX^ 

45 TGCTGGTAGGCGGAGTCXJrAGCAGCIXnGGCX:GCm'AT^^ 
On'GGTCATIXnXKKSCAGGATCATCITGltXXM 

AGGGAAGTCCirrACCGGGAGTlCOATGAGATGGAAGAGTCaKXl^ 
CTTACATCGAACAGGQAATGCAGCrcGOCXSAACAATI^^ 
GGTIXKnX3CAAACAGCX::A(XAAGCAAGaKjAGGC^^ 
50 CCAAGTGGOKJACXXnXXSAAGCCITCrKKK^ 

CGGOATACAATATITAGCAGGCITGTOCACrCTXKXr^^ 
TCACroATGGCATTCACAG<XriCTATCACX:AGCCCX3^^ 
TCCTGTITAACATCCTOGGGGGATGGGTGG<XGC(XAACTIXjC^^ 
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TGCTTCTCCrnajTAGCKXKXXKKIATC^^ 
CITCGGAAGGTGCITGTGGATATITrGGCAGGTrATG^ 
CGCTCGTGGCXnTrAAGGTCATGAGCGGCX5AGATGC(XTCCA(XX3AGGACCT 
TAACCTACTC<XrKKn"ATCXJr(:nXXXXJIXKKXXXXn"AGTC 
5 CAGOJATACTGCGTCXKK^ACGTGGGCCX^AGGGGAGGGGGCrGTGCAGTGGATGA 
ACCGGCTGATAGCGTTCGCrrCGOGGGGTAACCACGTCrrCCCCCACGCACTATGT 
GiXTOAGAGOjACGCraCAGCACmGTCACrcAGATCC^^ 
ACTCAGCTGCTCAAGAGGCITCACCAGTGGATCAACGAGGACTGC^ 
GCTmKKTKXmKMn'AAGAGATGTITGGGATrGGAT^ 
10 TTTCAAGACCTCGCKXAGTCCAAGCrcCTGCOGCXjAl^^ 
TtnXZATGTCAAOnGGGTACAAGGGAGTCTGGCXKKKK^ 
CX:ACCIXKXrATGTGQAGCACAGATCA<XXK5ACATGTGAAA^ 
GGATCXnXK3GG<XTAGGA<XTGTAGTAACAa3TGGCATGGAACATlX^^ 

aK:x3TACACx:AaKKKXXxnxK:A(XMXxrK^^ 

15 CimXKKXK3GTGGCnXXnX3AGGAGTACGTGGAGGTrA(XKXXM?^^ 

CACTAOnXSACGGGCATGACXACnGACAACXn'AAAGl^ 

CXXCXXSAATICrrcACAGAAGTCGATXKKKnXKXK^ 

GTGCAAACXXXJrCCTACGGQAGGAGGTCACATIXXrnKiTO^^ 

CriXXniXKXmiACAGClXXXATGCGAGCCCGAA^ 
20 CX;ATGCrcA<XGA(XCCTC(XACATTACGGCGGAGAOGGC^^ 

CAGGGGATCIXXaXXTTCCnTGGCCAGCTC 

COT0AAGGCAACATGCACTACXXGTCATGA(nXXXXXK5AC^^ 
GGCX:!AACX:nXXTGlXK3<XKK:AGGAGATGGGa^ 
AGAAAATAAGGTAGTAATTTTGGACTCriTIXX3AG<XX3C^^ 
25 TGAGAGGGAAGTATC(XmOXKKXKjAGATCXnXKXK}AGGl^ 
TCGAGC»ATGCCCATATGGGCA(X3C(XXK3ATTACAACCCI^ 
lXK3AAGGAC(XXK3ACTACGriXXOXX:AGTGGT^ 

. CCAAGG<XCCIXXGATA(X:ACXn'CCAaK}AGGAAGAGGAa3GTrGT^^ 

AATCTACXXnOTCTIXnCXXintKKXKSAGCI^^ 
30 CX3AATCGTCGGCCXnx:GACAGCXK]K:A(XKK:AA^^ 

GACGAtXKKXlACGCXKKSATCCGACGTTGAGTCCT 

AGGGGQAGCXXK3GGGATCXX»ATCTCAGCGACXKKj^^ 

AGGAGGCrAGTGAGGACGlCGT(nGCTGCTCGATGT(XTACACAT^ 

CCTGATCACXKX:ATGCGCroCXK3AGGAAACCAAGCTGCa:ATCAAT^ 
35 CAAClXnriGCTCCXSTCACXACAAC^^ 

AGCXnXKXjGCAGAAGAAGGTCACXnTIGACAGACTGCAGGTCO^ 

TA(XXKj«5ACX3TGCrCAAGGAGATGAAGGOGAAGGanCCAC^ 

CTKJTATCXXnWAGGAAGOCIGTAAGCnGACXKXXX^ 

AATTIGGCTATGGGGCAAAGGACmCCXKSAACCTATtXA 
40 ACATCXXKnXXXnGTGGAAGGACTroCIGGAAQAC^ 

(X:ACCATCATGGCAAAAAATGAGGTITKnX3a3TCCAACCAGAGAAGGGGGG^ 

GCAAG<XAGCIXXKXriTATCGTATTCXX::AGAT^^ 

AATGGaXJJTTACGATGTGGTCrnXACCXTItXX^ 

ACXK3ATrcCAATACnOXXnXK5ACAGCX«3GTCGAG^ 
45 AGCXiAAGAAAlXXXXTATGGGCnTCXK^ATATGACACCXGCT^^ 

GTCACroAGAATGACA.TCCmGTrcAGGAGTCAATCTAC^ 

OCXXXX}AAGCX:AOAC:AGGCX:ATAAGGTCXKnx::A^ 

GCXXXXJTOACTAATICrAAAGGGCAGAACKKXKSCrATCXKXXK^^ 

CGGTGTACTCACX5ACXLVGCIXK3GGTAATA<XCTCACATGTrAC^^ 
50 GCGGiXriXnxZGAGCTGaSAAGCTOCAGGACKKIACGATG^ 

GACXnTGTCXmATCKnCAAAGCXKXKKKj^^ 

CXjGGCXTTCAlXKjAGGCTATGACTAGATACTCro^ 

AACCAGAATAajACITGGAGlTGATAACATCATGCrc^ 
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GCAaSATGCATCKKXJAAAAGGGTGTACTATCTCACXCGTC 
CTTGCGa3GGCIX3CGTGGGAGACAGCTAGACACACTCCAGTCAATT^^ 
GCAACATCATCATGTATGCGC(XACCTlXnXKKK;AAGGATGAT(XTQATGAC^ 
TlTCnurcrATCCTTCTAGCTCAGGAACAAC^ 
5 TCTACGGGGCCTGTrACTXXATraAG(XACTroACCTA<XTCAGA 
ACT(XATGGCCTrAGCGCATITrcACrcCATAGTrACK^^ 
GGGTGGCTrcATGCO€AGGAAACriTGGGGTA(XGCCCTI^ 
TOGGGCCAGAAGTGTCCGCGCTAGGCTACTXSTCm^GGGGGGGAGGGCnX^^ 
TTGTGGCAAGTAO^TCTrcAACnXKKK^AGTAAW 
10 ATaXGGCKK:QTCCCAGTTGGATlTATCCAGCKHmx:GTI^^ 
GGGAGACATATATCACAGCCTXnXJTCGTGCCCXJACCCJCGC^^ 
CTACrCCTACTITCrrGTAGGGGTAGGCATCTATCT^ 
GGA(XTAAACACTCCAGG<XAATAGGCCATCCKmTITIT^ 

TiriiTiTimTiTriiimiTmiTiTi'iuKXjmi 

15 TIXnTKXnTrGGTGGCICCATClTACKX^ 

GTGAGCXXKnTGACTGCAGAGAGTGCKSATACTGGCXnXJrC^^ 

SEQIDNO:25: Nucleotide sequence of DNA clone of HCV additive replicim STITR- 
EMCV/HCVrepVn 

20 

GCX:AGCXXX:CGATrGGGGGCGACA<ntXACCATAGATCACnXXa^^ 

TACTCTCrrcACXK^AGAAAGCXnurAGCCATGGC^ 

antX;AGGACmXXXnCCCXKKJAGAG<XATAGT^^ 

CA(XGGAATTG(XAGGAa3A(XXKKnXXTITCTIXK3ATCA^ 
25 GGAGATTTGGGCGTGOCmXXXUGACrGCTAGCCGAGTAGTGTTGGC^^ 

AAGGCCTIXmKnACKKXTOATAGGGTGCITGCGAGT^^ 

AGACXimXKrACCAGACX^ACAACXKnTnXCICTAGC^ 

TaxnXXCXXXXXXX:TAACGTrACTGG<X»AAGCCGCIT^ 

QC«TrrGTCTATATGTrATTTrcx:AOC^^ 
30 CXXKjAAACCTGGCXXTCTCrnnTQACXjAGCATTOT 

CCAAAGGAATGCAAGGTCrcmraAATGTCXnXSAAGGAA 

CTTCTroAAGACAAACAAanxnGTAGCGACXXnTr^ 

ACXJTGGCXSACAGGlXKXriUIGCXKKXIAAAAGCCAC^^ 

AAAGG<XKK::ACAAO(XCAGTGCX:ACXmX3TGAGTI^ 
35 CAAATGGCICTCXrrcAAGCGTATTCAACAAGGGGCroAAGGAT^^ 

ACCXX^ATKnATGGGATCTCATCTGGGGCOXXMnXK:^ 

AGTCX5AGGTrAAAAAAa?lCTAGGCCXXXX»AACCA0GG^ 

TGAAAAACAa}ATAATACX:ATGGCG<XTATrACXKKXnACI^^ 

AGGCCTACTTGGCKKIAltLVTCACnAGC^^ 

40 qagggggaggtccaagtggtctocacxxk^cacaatctto 
tcaatggcgtgtgttggacixjixjratcatgglx^^ 
cccaaagggcxx:aatca(xcaaatgtaca(x:aatgtggaccag^ 

(nXKKJAAGCGCCXXXXXKKSGOKXmtxnTGACACCATGC^ 
GACCTTTACnTXKjTCACXJAGGCATGCCWATGTCAT^^ 

45 ACAGCAGGGGGAGCCTACTCTCCCCCAGGiXXXiT^^ 
GGGOGGTCCACnX3CnxnXKXXXnXX3GGGCA<X^ 
GTGTCC^CXXXM^GGGGTKKXW^GGCXKiTGGAa^^ 
AAA<XACTATG(X3GTCXXXX3GlXnTCACXK3ACAACri^^ 
GCAGACATTCCAGGTGG<XCATCTACACXXXXXJrACIXK3TAGC^^ 

50 AAGGTQCCGGCTOi^AlXK^AGCCXAAGGQTATAAGGlXXnT^^ 
OCXnCXKXXKX:AO(XTAGGTrraKKKKXnATAT^^ 
(XXn7y^CATCAGAA<XXK3GGTAAGGACX:ATCACCAa3GGT^^ 
CTCCA<XTATGGCAAGTnCTIX3CXX}A0GGT^^ 
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ATCATAATATGTGATGAGTGCCACTCAACnXSACrCGACX^ACTATCCTGa 
GCACAGTCO'GGAaL\AG<XK}AGACXKKnGGAGa3^ 
C0GCTACGCOXXXKK3ATCXKnCACamXX:ACA^ 
TCrGTCCAGCACrKK3AGAAATCO:CITrrATGGCAAAGC^ 
5 ATCAAGGGGGGGAGGCACXJrcATITIOGQ:ATia:AAGAAGAAAT^ 
CrCCCCXSaSAAGCTCTaXKKXTOGGAC^ 

TXGATGTATCCGTCATACCAACTAGOKjAGAajTCATTGTCXjrAG^ 
TCTAATGACGGGCirrACCGGOSATITCXSACTCAGTGATCGACI^^ 
GTCACXXJAGACAGTCXJACnTCAGCCIXMACmSACC^^ 
10 CCGTGCCACAAGACXKXSGTGTCACGCrcGCAGCXKKXJAGGCAGGACT^ 
GCAGGATGGGCAirrACAGGTTimGACICCAGGAGAACGGCX^^ 
(XSATTCXnX^GGTTCKntXXSAGTGCTAT^^ 

ACGC<XGCXXjAGACOX::AGTrAGGTroCGGGCITACCTAAACACA(XAGG^ 
CXX]nrCrro«:AGGA<XAT(nXjGAGTTC^^ 

15 CA.TAGAaK;CCATITCriTGTCCC^GACrAAGCAGGG^ 
CnXKn-AGCATACXIAGGCTAaSGTCTGaK^ 
GGGACCAAATXnXK3AAGTGTCTCATACXKKTAAAGa]TAC^^ 
CGCCCCTGCTGTATAGGCIXKKSAGCmrrcAAAACXSAGGT^^ 
CATAACCAAATACATCATCGCATGCAlXnxXKKnGAO^^ 

20 ACXnXXMSTGCIGGTAGGCXKiAGTCCTAGCAGCrC^^ 

CAGGCAGCGTGGTCATTGTGGGCAGGATCATCTIGTtXGGAAAGiXGGCrAT^^ 
TCCmACAGGGAAGTCOTrACXXKSGAGTIXXiAT^ 
ACACX:TCXXTrACATCXjAACAGGGAATGCAGCICGC^^ 
GGCAATCGGGTrcCTGCAAACAGCCA<XAAGCAAGCXK3AGGCIXKnXK^^ 

25 GGTGGAATCCAAGTCGCXKjA(XCia3AAG<XTIXnX3GGajAAGC^ 
TTTCATCAGCXXKJATACAATATrTAGCAGGC^^ 

GC»ATAGCATCACTGATGGCAlTCACAGCXTCrATCACCAGaXXKnCAC^ 

AACATACCCTCCrarnAACATCCIGQGGGGATGGGTGGC^^ 

TCXCAGCGCTGCITCTGCnTCXnAGGCGCXX^ 
30 AGCATAGGCCrroGGAAGGTCKnTGTGGATATTITGGCAGGTTAl^ 

TGGCAGGCGOKnXIXnXKKXTITAAGGlXIATGAGC^^ 

GGACCKKSTTAACCTACntXCroCrATCXnOC^ 

TCQTGTGCGCAGOQATACKKX3TCX3GCAOGTGGGC^^ 

AGlXM5ATGAACXXKKnGATAG(XriTCXKnTO 
35 GCACTATGTGCCTOAGAGCGACGCnKLVGCACGTGTCACrcAGAT^^ 

CTrA<X:ATCACnx:AGCTCCTOAAGAGGCITCACCAGIXK}Alt^ 

eCACXKXATGCItWKKJTtXmKKn"^ 

GTTOACTCATTTCAAGACXJIXKKTOCAGTOCAAGCI^^ 

GTCCCCTIXnTCrcATGTCAACXmSGGTACAAGGGAGTCK^^ 
40 TCATGCAAACCACXnGCCCATGrrGGAGCACAGAT^ 

GTIXXATQAGGATCmX3GGG<XTAGGA0CnGTAGTAACA<Xn^ 

<X0CATrAAaK:X3TACACX:A0GGGCXXXTOCA0GC^^ 

TCTAGGGC«CIX3TGG0GGGTGGCTGCTOAGGAGTACGTGGAGGTrAC^^ 

GGGGATITCCACTACXnGACGGGCATGACCACIGACAACGTi^ 
45 AGGTTCCGGCCXXXXJAATIXnTCACAGAAGTGGATGGGGTGa^^ 

CGCIXXAGCGTGC7USA<XXXritXTACXKK5AG^ 

AATCAATACCroGTIGGGTCACAQCrOOCATGC» 

GTGCTCACITCCATGCnCACXX3AO0ant^^ 

GTAGGClXKKX:AGGGGATCrnXXXXCTCXnTGGC^ 
50 GTCrGCG(XTIXXJrrGAAGGCAACATGCACTACCa3T^ 

GACCrCAT<X?AGG(XAA(XTCCIGTGGC GGCAG GAGATGGGCGGGAACATC^ 

aKX5TGGAGTCAGAAAATAAGGTAGTAATTTIXK3ACr^^ 

CGGAGGAGGATXM^GAGGGAAGTATCOHTXXKKKXMJAGATCXnX^^ 
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GGAAAlTXXCTCGAGCGATGaXATATGGGCAOKXXXKjATrAC^ 
GTTAGAGTCCTGGAAGGAOCCXKjACTACXmXXnXXIAGTXK^ 
TKKX:GCOX3CCAAGGC(XCim3ATAa:^CCIXX 
TCXJrGTCAGAATCTAOXTCTCTTCI^^ 

5 GGCAGCnXXGAATOGTCGGCCGTCGACAGCGGCACGGCAAaKK^ 

AGCXXTCX:GACX3AaKKmCGCXKK3ATCXXf ACGTTOAGTC^ , 
C(XCCTTGAGGGGGAGOCGGGGGATCXXX}ATCrcAGCGACGGGTCITGGTCrACC 
GTAAGCXJAGGAGQCTAGTGAGGACXJTCGTCTCCIG^ 
CAQGaKXX:TOATCAC!Ga:ATGCGCrGCGGA 

10 CACnXjAGCAACTCTTTGCTCanCAa^ACAAC^ 

AGCXKIAAGCOXKXKK:AGAAGAAGGTCACOTroACA^ 
QAOCACTAOCiGGQACXHGCrcAAGGAGATGAAQGCGAAGGCXH^ 
GCTAAACTICTATCCGTGGAGGAAGOCTCTAAGCrGACGCXXCCACATI^^ 
GATCTAAATTIXjGCTATGGGGCAAAGGACGTCCGGAACCTATCCAGi^ 

15 TTAACX^ACATCXXKntXMTOTGGAAGGACITGCTGGAA 

TGACACX:ACCATCATGG(:7WUUWUVTaAGGTITrcTG0GT<XAAC^^ 
GGGCCGCAAGCCAGCTCXKXTrATCGTATim^AGATTI^^ 
GAGAAAATGGCXXJITrACGAimXKnXJKX:AC(XTC^^ 
CITCATACGGATTCCAATACTCrcCIGGACAGCXK^^ 

20 CnXK3AAAGa3AAGAAATG<XCTATGGGCTIXXK:ATATGACA(XXX 
TCAAOKjTCACnGAGAATGACATCXXnxnTGAGGAGTCAATCT 
ACnXKKXXXXXxAAGCCAGACAGGCCATAAGGTCGCTCACAGAGCGG 
CGGGGGCC<XCTGACTAATTCTAAAGGGCAGAACrGCGGCTAT^ 
GCCAGCXKmnACTOACGACCAGCnmKnAATAaxr^ 

25 aXKJTGCGGOnXSTCGAGCnXKXSAAGCrcCAG^ 

agacxsaccttgtcottatckngaaagakxkkkjac^^ 
cxjtacxkxkxnrcacxkjaggctatgactagatact^^ 
ccx:aaacx:agaatacjgacitggagttoataacat(:^^ 
tojcccacmtqcatctcgcaaaaggqt^ 
30 ccxxjctkxxkxkkscroogtgggagacagctagacacac^ 

CTAGGCAACATCATCATGTATGCGCCXIACOTGTGGGCAAGGATGAT^^ 

CTCATTTCITCTCCATOC^^ 

CAQATCTACXKKKKXnGITACnXXATTOA^ 

AACX3ACKX:ATGG0CrrAGCGCATTTrcAC^^ 
35 AATAGGGTGGCTTCATGCOCAGGAAACmKKKn'Ai^^ 

GACATCXKKKXAGAAGTGlXXXiCXKiTAGGCTACn^ 

ODACITGTGGCAAGTACXTnjITCAACnXKjGCAGTAAW 

TCXZIAATCXXXKKTOOjim^AGTKKJAT^ 

GOK^kKKiAQACATATATCACAGCXJimXJK^^ 
40 GTOCXnACTCCTACTnUimAGGGCTAGGCATCTATCT 

CXKKKJACCTAAACACICCAGGCCAAT^GCCAT^ 

mririiiTirrnTimTrmrriiiiiiiiiiicixxrrrm'r 

CXTmxnrrCCTITGGTGGCTOCATCTrAGCCCT 
GT0aHX3^AGCaKTIGACnXK:AGAGAGTGCIXl^^^ 
45 GT 
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What is claimed is: 

1. A polynucleotide comprising a non-naturally occurring HCV sequence that is 
capable of productive replication in a host cell, or is capable of being transcribed into a non- 
naturally occurring HCV sequence fliat is capable of productive replication in a host cell, 
wherein the HCV sequence comprises, &om 5' to 3' on the positive-sense nucleic acid, a 

5 functional 5' non-translated region (5* NTR); one or more protein coding regions, including at 
least one polyprotein coding region that is capable of rq>licating HCV KNA; and a Amctional 
HCV 3' non-translated region (3* NTR). 

2. The polynucleotide of claim 1, further conqirising an adaptive mutation. 

3. The polynucleotide of claim 2, having a transfection efSdency into mammalian 
cells of greater than 0.01%. 

4. The polynucleotide of claim 3, wherein tiie transfection efiBciency into mammalian 
cells is greater than 0.1%. 

5. The polynucleotide of claim 3, ^dierein the transfection efficiency into mami^^ 
cells is greater tiian 1%. 

6. The polynucleotide of claim 3, wherein the transfection eflSciency into mammalian 
cells is greater than 5%. 

7. The polynucleotide of c}aim 2, wfaerem the polynucleotide is capable of 
rq>lication in a non-hqmtic cell. 

8. The polynucleotide of claim 7, wherein the non-hepatic cell is a HeLa cell. 

9. The polynucleotide of claim 2, wherein the HCV is impaired in its ability to cause 
disease, establish chronic infections, trigger autoimmune req>onses, and transform cells. 

1 0. The polynucleotide of claim 2, wherein the polyprotein region comprises an 
NSSA gene tiiat is not a wild-type NS5A gene. 

1 1 . The polynucleotide of claim 10, wherein tiie NSSA gene comprises a mutation. 
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12. The polynucleotide of claim 1 1, wherein the mutation is wifhin SO nucleotides of 
an ISDR or includes the ISDR. 

13. The polynucleotide of claim 12, wherein the mutation is witiun 20 nt of the 
ISDR» or includes die ISDR. 

14. The polynucleotide of claim 1 3, whoein the mutation encodes an amino acid 
sequence change selected fiom Ihe groiq> consisting of Ser (1179) to lie, Aig (1164) to Gly, 
Ala(l 174) to Ser, .Ser(l 172) to Cys, and Sei(l 172) to Pro of SEQ ID NO:3. 

15. The polynucleotide of claim 11, wherein the mutation conqnises a deletion of at 
least aportion of the ISDR. 

16. The polynucleotide of claim IS, wherein the mutation comprises a deletion of the 
entire ISDR. 

17. Hie polynucleotide of claim 16, wherein the mutation comprises a deletion of 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 

1 8. Hie polynucleotide of claim 1, wherein the polynucleotide comprises at least one 
IRES selected fifom the group consisting of a viral IRES, a cellular IRES, and an artificial 
HUBS. 

19. Thel>olyn\icleotideof claim 18, wherein the HCVpolyprotem 
encodes all HCV structural and nonstructural proteins. 

20. The polynucleotide of claim 19, further-comprismg a foreign gene qperably 
linked to a first IRES and tiie HC^ polyprotein coding re^ 

IRES. 

21. The polynucleotide of clann 18, wfaa:einttiepoly^ 
incapable of makii^ infectious HCV particles. 
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22. The polynucleotide of claim 21, iwlieidnlliepol^^ 

a mutation and/or a deletion in tibe stnictural protein coding region. 

23. The polynucleotide of claim 22, furflier comprismg a fordgn gene operably 
linked to a first IRES and the HCV polyprotein coding region operably linked to a second 
IRES. 

24. Ilie polynucleotide of claim 23, ^dieieha the fordgn gene is a gene encoding a 
selectable maiker or a reporter ^e. 

25. The polynucleotide of claim 24, finlfaer comprising an adaptive mutation. 

26. The polynucleotide of claim 25, having a transfection eiGBciency into mammalian 
cells of greater than 0.01%. 

27. The polynucleotide of claim 26, wherein tiie transfection efSciency into 
mammalian cells is greater than 1%. 

28. The polynucleotide of claim 26, wherein the transfection efScieni^ into 
mammalian cells is greater than 5%. 

29. The polynucleotide of claim 26, wfaerem the transfection efiSdeoc/i^ 
mammalian cells is about 6%. 

30. Ibe polynucleotide of claim 25, wherein the polynucleotide is capable of 
replication in a non-hqpatic cell. 

31. The polynucleotide of claim 30, ^Kdserein the non4iq>a& 

32. The polynucleotide ofclaim 25, wherein the HC^^ is inq)aired in it^ 

cause disease, establish chronic infections, trigger autoimmune responses, and transform cells. 

33. Ibe polynucleotide of claim 25, wherein the polyprotein region conqxrises an 
NS5 A gjsne that is not a wfldrtype NS5A gene. 
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34. Hie polynucleotide of claim 33, wherein die NSSA gene comprises a mutation. 



35. Hie polynucleotide of claim 34, wherein the mutation is within 50 nucleotides of 
an ISDR or includes flie ISDR. 

36. The polynucleotide of claim 34, wheiein flie mutation is within 20 nt of the 
ISDR, or includes Ae ISDR. 

37. The polynucleotide of claim 36, wherein the mutation encodes an amino acid 
sequence change selected 6om the group consisting of Set (1 179) to He, Aig (1 164) to Gly, 
Ala(l 174) to Ser, Ser(1172) to Cys, and Ser(1172) to Pro of SEQ ID NO:3. 

38. Hie polynucleotide of claim 34, wherdn the mutation comprises a deletion of at 
least a portion of the ISDR. 

39. The polynucleotide of claim 38, wherein the mutation comprises a deletion of tiie 
entire ISDR. 

40. Hie x>olynucleotide of claim 39, wherein the mutation comprises a deletion of 
nucleotides CQrreqxmding to nucleotides 5345 to 5485 of SEQ ID NO:6. 

41. Hie polynucleotide of claim 24, wfaeiein: 

(a) Ae first KES is an HCV IRES; 

(b) tte foreign gene is a neo gene; and 

(c) fte second IRES is a EMCV IRES. * 

42. Hiepolynucleotideofclaim41, herein the HCV sequence is a genotype 1^ 
sequence. 

43. The polynucleotide of clahn 42, wfaeiein the HCV sequence is subtype lb. 

44. The polynucleotide of clahn 41, comprismg SEQ ID NO:5 or SEQ ID NO:6. 



45. The polynucleotide ofclaim 41, further comprismg an adaptive mu^ 
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46. The polynucleotide of claim 45, having a transfection efBciency into mammalian 
cells of greater than 0.01%. 

47. The polynucleotide of claim 46, wherein the transfection efiSciency into 
mammalian cells is greater than 1%. 

48. The polynucleotide of claim 46, wheiein the tnmsfection efBciency into 
mammalian cells is greater than 5%. 

49. The polynucleotide of claim 46, wherein the transfection efiSciency into 
mammalian cells is about 6%. 

50. The polynucleotide of claim 45, wherein &e polynucleotide is cq)a^ 
replication in a non-hqmtic cell. 

51. The polynucleotide of claim 50, wherein the non-hqMitic cell is a HeLa cell. 

52. The polynucleotide of claim 45, wherein the HCV is impaired in its ability to 
cause disease, establish chronic infections, trigger autoimmune responses, and transform ceUs. 

53. The polynucleotide ofclaim 45, wherein the polyprotein region comprises an 
NS5 A gene that is not a wild-type NS5 A gene. 

54. The polynucleotide of claim 53, who^ein die NS5 A gene comprises a mutation. 

55. The polynucleotide of claim 54, whoein the mutation is witiun 50 nucleotides of 
an ISDR or includes tiie ISDR. 

56. The polynucleotide of claim 54, who^ the mutation is within 
ISDR, or includes the ISDR. 

57. The polynucleotide of claim 56, ^^leran the mutation encodes an amino acid 
sequence change selected from the group oonsisti]]^ of So: (1179) to He, Arg (1164) to Gly, 
Ala(1174) to Ser, Ser(1172)toCy8.andSer(1172) toPto of SEQIDNO:3. 
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58. Hie polynucleotide of claim 54, wherein the mutation comprises a deletion of at 
least a portion of the ISDR. 

59. The polynucleotide of claim 58, whea:ein the mutation conqmses a deletion of the 
entire ISDR. 

60. The polynucleotide ofclaim 59, herein the mutation conqyrises a del^ 
nucleotides corresponding to nucleotides 5345 to 5485 of SEQ ID N0:6. 

61. The polynucleotide of claim 1, wherein the polynucleotide is douhle-stranded 

DNA. 

62. A vector conpising the polynucleotide of claim 61 cq;)^^ 
promoter. 

63. The polynucleotide of claim 41 wherein the polynucleotide is double-stranded 

DNA. 

64. A vector comprising the polynucleotide of claim 63 operably associated with a 
promoter. 

65. The vector ofclaim 64, fur&er comprising a mutation in the NS5 A gene. 

66. The vector ofclaim 65, ^(dKcrem the mutation is selected 

consistmg of mutations encoding flie amino add changes Ser (1 179) to He, Arg (1 164) to Gly, 
Ala(l 174) to Ser, Ser(1172) to Cys, and Sa{l 172) to Pro of SEQ ID NO:3; and an in fiame 
deletion of nucleotides encoding amino adds comprising at least a portion 

67. The vector of claim 66, wheardn the mutation comprises a deletira 

ISDR. 

68 . The vector of claim 67, wbeietn ttie mutation cooiprises a deletion of nucleotides 
corresponding to nucleotides 5345 to 5485 of SEQ ID NO:6. 



69. A cell c omprisin g tibe vector of claim 62. 
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70. A host cell comprismg flie polynucleotide of claim 2, wherein the host cell is a 
mammalian cell. 

71. The host cell ofclaim 70, ^lerein^ polynucleotide conqpr^ 
mutation. 

72. The host ceUofclaim 71 wfaeremihe host ceU is a human cell. 

73. Thehostcellofclaim72¥Aiei:emfliehostoeUisaUvercell. 

74. ThehostcdIofcIaim72wfaeranthehostcelli8aT-<^lloraB-cell. 

75. The host cell of claim 72 ^vrfiezein the host ceU is a HeLa cell. 

76. A method for identifying a ceU line that is pennissivefcH' infection with H(^ 
comprising contacting a cell in tissue culture with an infectious amount of the polynucleotide 
of claim 1, and detecting iqilication of HCV in cells of the cell line. 

77. A method fix producing a cell line comprising replicating HCV, the method 
comprising . 

(a) transmbing flie vector of claim 62 to synthesize HCV KNA; 

(b) transfectmg a cell wiflilheHCVKNAof step (a); and 
S (c)culturingthecell. 

78. A vaccine comprising fhe polynucleotide of claim 1 in a pharinac^ 
accq>table carrier. 

79. The vaccine ofclaim 78, i^Aerem the polynucleotide furf^ 
adaptive mutation. 

80. The vaccine ofclaim 79, wfaeiem the adaptive mutation coinprises a deleti 
nucleotides ooneqxnidmg to nucleotides S34S to S48S of SEQ ID NO:6. 
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' 8L The vaocine of claim 80, whodn HieHCV is iiiq)aired in its ability to cause 
disease, establish cliroiiic infections, trigg^ autoinmnme responses, and transform cells. 

82. A method of inducing immunoprotection to HCV in a primate, conqxrising 
administering to fhe primate fte vaccine of claim 78. 

83. A method ofindudngtnmoiunqirotection to HCV^ in a primate^ 
admimstering to the primate the vaccine of claim 81. 

84. A method of testing a compound for inhibiting HCV replication, conqnising 

(a) treating the host cell of claim 70 with the conq)ound; 

(b) evaluating the treated host cell for reduced HCY replication, wherein reduced 
HCV rq>lication indicates the ability of tibie conqK)und to inhibit HCV rqilication. 

85. A method of testing a conq)ound for inhibiting HCV infection comprising 
treating a host cell with the compound before, during or after infecting or transfecting the host 
cell with the polynucleotide of claim 1. 
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HCV VARIANTS 
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SEQUENCE LISTIKG 

<110> Rice III, Charles 
Blight, Keril 

<120> HCV Variants 

<130> 6029-7868 

<140> 
<141> 

<150> 09/576,989 
<151> 2000-05-23 

<150> 09/034,756 
<151> 1998-03-04 

<160> 24 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 21 
<212> DNA 

<213> Hepatitis C virus 
<400> 1 

ggcgacactc caccatagat c 21 

<210> 2 
<211> 99 
<212> DNA 

<213> Hepatitis C virus 
<400> 2 

tggtggctcc atcttagccc tagtcacggc tagctgtgaa aggtccgtga gccgcatgac 60 
tgcagagagt gctgatactg gcctctctgc tgatcatgt 99 

<210> 3 " 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 3 

Met Ala Pro lie Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

Cys Xle Xle Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro lie Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 
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Asp Val lie Pro Val Arg Arg tog Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 
195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr Gly 
260 265 ^ 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 ' 295 300 

Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 
305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 
325 330 335 

He Glu Glu Val Ala Leu Ser Ser. Thr Gly Glu He Pro Phe Tyr Gly 
340 345 350 

Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He Phe 
355 360 365 

Cys His Ser Lys Lys Lys Cys Asp Glu' Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp val Ser Val 
385 390 395 400 

He Pro Thr Ser Gly Asp Val He Val Val Ala Thr Asp Ala Leu Met 
405 410 415 

Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn Thr Cys 
420 425 430 

Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu 
435 440 445 



Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 
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Arg Thr 61y Arg 61y Arg Met Gly lie Tyr Arg Phe Val Thr Pro Gly 
465 470 475 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 

Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
515 520 525 

His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He Asp 
530 535 540 

Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 

Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 

Pro Ser Trp Asp Gin Met Trp Lys Cys Leu lie Arg Leu Lys Pro Thr 
580 585 590 

Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn 
595 600 605 

Glu Val Thr Thr Thr His Pro lie Thr Lys Tyr lie Met Ala Cys Met 
610 615 . 620 

Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 

He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro Asp 
660 665 670 

Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
675 680 685 

His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys 
690 695 700 

Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala 
705 710 715 720 

Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Met Ala Phe 
755 760 7^65 

Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 



Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 
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He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 

Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro 
850 855 860 

Glv Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
885 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 935 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 

Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly 
980 985 990 

Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 
995 1000 1005 

Gin He Thr Gly His Val Lys Asn Gly. Ser Met Arg He Val Gly Pro 
1010 1015 1020 

Ara Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr 
1025 1030 1035 1040 

Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala 
1045 1050 1055 

Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly 
1060 1065 1070 

Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 1110 Ills 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly 
1155 1160 1165 
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Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
1170 ll^S USD 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu lie Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 1240 1245 

lie Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro lieu Pro Pro Ala Lys Ala - 
1285 1290 1295 

Pro Pro He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 
1300 1305 1310 

Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly 
1315 1320 1325 

Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys 
1395 1400 1405 

Ala Ala Glu Glu Thr Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu 
1410 1415 1420 

Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val 
1460 1465 1470 

Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro 
1475 1480 1485 

His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 1500 

Leu Ser Ser Lys Ala Val Asn His He Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 
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Leu Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn 
1525 1530 1535 

Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 

Leu He Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 

Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
1585 1590 1595 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
.1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp. He Arg Val Glu Glu Ser 
1620 1625 1630 

He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg 
1635 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr He . Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Qys Arg Ala Ser Gly Val Leu Thr 
1665 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 1690 1695 

Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp 
1700 1705 1710 

Asp Leu Vai Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 

Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr 
1765 1770 1775 

Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1830 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1855 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 
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Pro Gly Glu lie Asn Arg Val Ala Ser Cys I-eu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala 
1925 1930 1935 

Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly 
1940 1945 1950 

Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp 
1955 I960 1965 

Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn 
1970 1975 1980 



Arg 
1985 



<210> 4 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 4 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
1 5 .10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Qys Gly Ala Gin He Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
. 65 70. 75 . 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Tlrg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 
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Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
X95 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tvr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 . 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 5 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 
<400> 5 

gccagccccc gattgggggc gacactccac 
tcttcacgca gaaagcgtct agccatggcg 
cccccctccc gggagagcca tagtggtctg 
gacgaccggg tcctttcttg gatcaacccg 
gcgagactgc tagccgagta gtgttgggtc 
gtgcttgcga gtgccccggg aggtctcgta 
ctcaaagaaa aaccaaaggg cgcgccatga 
cggccgcttg ggtggagagg ctattcggct 
ctgatgccgc cgtgttccgg ctgtcagcgc 
acctgtccgg tgccctgaat gaactgcagg 



catagatcac tcccctgtga ggaactactg 60 
ttagtatgag tgtcgtgcag cctccaggac 120 
cggaaccggt gagtacaccg gaattgccag 180 
ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgaaaggcc ttgtggtact gcctgatagg 300 
gaccgtgcac catgagcacg aatcctaaac 360 
ttgaacaaga tggattgcac gcaggttctc 420 
atgactgggc acaacagaca atcggctgct 480 
aggggcgccc ggttcttttt gtcaagaccg 540 
acgaggcagc gcggctatcg tggctggcca 600 
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cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct Htgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa goggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
* aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgUggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
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cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaa€iatga ggttfctctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ctcctttttt 7860 
tttcctcttt ttttcctttt ctttcctttg gtggctccat cttagcccta gtcacggcta 7920 
gctgtgaaag gtccgtgagc cgcttgactg cagagagtgc tgatactggc ctctctgcag 7980 
atcaagt 7987 

<210> 6 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 
<400> 6 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
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tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggi: gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc acrtcaactga ctcgaccact 2700 
^ atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga* ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtcftctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttiacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
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tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctcgagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
•tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggacctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 
tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 
tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 
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agatcaagt 7989 

<210> 7 
<211> 7848 
<212> DNA 

<213> Hepatitis C virus 
<400> 7 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagogaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc,2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
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ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctat caeca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtactctt tcgagccgct ccaagcggag gaggatgaga gggaagtatc cgttccggcg 5400 
gagatcctgc ggaggtccag gaaattccct cgagcgatgc ccatatgggc acgcccggat 5460 
tacaaccctc cactgttaga gtcctggaag gacccggact acgtccctcc agtggtacac 5520 
gggtgtccat tgccgcctgc caaggcccct ccgataccac ctccacggag gaagaggacg 5580 
gttgtcctgt cagaatctac cgtgtcttct gccttggcgg agctcgccac aaagaccttc 5640 
ggcagctccg aatcgtcggc cgtcgacagc ggcacggcaa cggcctctcc tgaccagccc 5700 
tccgacgacg gcgacgcggg atccgacgtt gagtcgtact cctccatgcc cccccttgag 5760 
ggggagccgg gggatcccga tctcagcgac gggtcttggt ctaccgtaag cgaggaggct 5820 
agtgaggacg tcgtctgctg ctcgatgtcc tacacatgga caggcgccct gatcacgcca 5880 
"tgcgctgcgg aggaaaccaa gctgcccatc aat:gcact.ga gcaactctt't gctccgtcac 5940 
cacaacttgg tctatgctac aacatctcgc agcgcaagcc tgcggcagaa gaaggtcacc 6000 
tttgacagac tgcaggtcct ggacgaccac taccgggacg tgctcaagga gatgaaggcg 6060 
aaggcgtcca cagttaaggc taaacttcta tccgtggagg aagcctgtaa gctgacgccc 6120 
ccacattcgg ccagatctaa atttggctat ggggcaaagg acgtccggaa cctatccagc 6180 
aaggccgtta accacatccg ctccgtgtgg aaggacttgc tggaagacac tgagacacca 6240 
atitgacacca ccatcatggc aaaaaatgag gttttctgcg tccaaccaga gaaggggggc 6300 
cgcaagccag ctcgccttat cgtattccca gatttggggg ttcgtgtgtg cgagaaaatg 6360 
gccctttacg atgtggtctc caccctccct caggccgtga tgggctcttc atacggattc 6420 
caatactctc ctggacagcg ggtcgagttc ctggtgaatg cctggaaagc gaagaaatgc 6480 
cc^atgggct tcgcatatga cacccgctgt tttgactcaa cggtcactga gaatgacatc 6540 
cgtgttgagg agtcaatcta ccaatgttgt gacttggccc ccgaagccag acaggccata 6600 
aggtcgctca cagagcggct ttacatcggg ggccccctga ctaattctaa agggcagaac 6660 
tgcggctatc gccggtgccg cgcgagcggt gtactgacga ccagctgcgg taataccctc 672o 
acatgttact tgaaggccgc tgcggcctgt cgagctgcga agctccagga ctgcacgatg 6780 
ctcgtatgcg gagacgacct tgtcgttatc tgtgaaagcg cggggaccca agaggacgag 6840 
gcgagcctac gggccttcac ggaggctatg actagatact ctgccccccc tggggacccg 6900 
cccaaaccag aatacgactt ggagttgata acatcatgct cctccaatgt gtcagtcgcg 6960 
cacgatgcat ctggcaaaag ggtgtactat ctcacccgtg accccaccac cccccttgcg 7020 
cgggctgcgt gggagacagc tagacacact ccagtcaatt cctggctagg caacatcatc 7080 
atgtatgcgc ccaccttgtg ggcaaggatg atcctgatga ctcatttctt ctccatcctt 7140 
ctagctcagg aacaacttga aaaagcccta gattgtcaga tctacggggc ctgttactcc 7200 
attgagccac ttgacctacc tcagatcatt caacgactcc atggccttag cgcattttca 7260 
ctccatagtt actctccagg tgagatcaat agggtggctt catgcctcag gaaacttggg 7320 
gtaccgccct tgcgagtctg gagacatcgg gccagaagtg tccgcgctag gctactgtcc 7380 
caggggggga gggctgccac ttgtggcaag tacctcttca actgggcagt aaggaccaag 7440 
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ctcaaactca ctccaatccc ggctgcgtcc cagttggatt tatccagctg gttcgttgct 7500 
ggttacagcg ggggagacat atatcacagc ctgtctcgtg cccgaccccg ctggttcatg 7560 
tggtgcctac tcctactttc tgtaggggta ggcatctatc tactccccaa ccgatgaacg 7620 
gggacctaaa cactccaggc caataggcca tcctgttttt ttcccttttt ttttttcttt 7680 
tttttttttt tttttttttt tttttttttt tctccttttt ttttcctctt tttttccttt 7740 
tctttccttt ggtggctcca tcttagccct agtcacggct agctgtgaaa ggtccgtgag 7800 
ccgcttgact gcagagagtg ctgatactgg cctctctgca gatcaagt 7848 

<210> 8 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 
<400> 8 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
.cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg. gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
. atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggHc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggcg agcgcccccc ggg^cgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctt-Hactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
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ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggrtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tatcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc .4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctatccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgt^jgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacraga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctigacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctiggtgaat 6600 
gcctggaaag cgaagaaatg ccctiatgggc ttcgcatatg acacccgclig ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gact.agatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
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tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ctcctttttt 7860 
tttcctcttt ttttcctttt ctttcctttg gtggctccat cttagcccta gtcacggcta 7920 
gctgtgaaag gtccgtgagc cgcttgactg cagagacrtgc tgatactggc ctctctgcag 7980 
atcaagt 7987 

<210> 9 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 
<400> 9 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcac-ta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
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accacgggt:g cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 28B0 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtggg agtgtctcat acggctaaag cctacgctgc acgggccaac.gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccrttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctatcacca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctggtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tegggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag c^gtgctcac ttccatgctc accgacccct cccacattac ggcggagacg ^80 
gctaagcgtg ggctggccag gggatctccc ccctccttgg ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg. 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 61 BO 
gtgctcaagg agatgaaggc gaaggcgtcc acagtitaagg ctaaacttct aticcgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aa-ttgacacc accat.catgg caaaaaatga gg'tt'tt.ctigc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
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atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tat.atcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggacctaa acactccagg ccaataggcc atcctgtttt 7800 
tttfccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 
tttttcctct ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 
tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 
agatcaagt 7989 

<210> 10 
<211> 7989 
<212> DNA 

<213> Hepatitis C virus 
<400> 10 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
^agtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccgi gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcci 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
. gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
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caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2X00 
ggcagctcgg acctttactt ggtcaogagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
at'accaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag actaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctat caeca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg -tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggcttcacca gtggatcaac 4620 
gaggactgct ccacgccatg ctccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccajigc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacdtg tagtaacacg tggcatggaa cattccccat taacgcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5X60 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgt ccagctcatc agctagccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
cbcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatiggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
acggcctctic ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgt-ac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
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acaggcgccc tgatcaogcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctt.tgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggccgtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgggc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattgta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
licctggctag gcaacatcat catgta^gcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag t.aaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggacctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ttctcctttt 7860 
ttttt octet ttttttcctt ttctttcctt tggtggctcc atcttagccc tagtcacggc 7920 
tagctgtgaa aggtccgtga gccgcttgac tgcagagagt gctgatactg gcctctctgc 7980 
agatcaagt 7989 

<210> 11 
<211> 1341 
<212> DNA 

<213> Hepatitis C virus 
<400> 11 

tccggctcgt ggctaagaga tgtttgggat tggatatgca cggtgttgac tgatttcaag 60 
acctggctcc agtccaagct cctgccgcga ttgccgggag tccccttctt ctcatgtcaa 120 
cgtgggtaca agggagtctg gcggggcgac ggcatcatgc aaaccacctg cccatgtgga 180 
gcacagatca ccggacatgt gaaaaacggt tccatgagga tcgtggggcc taggacctgt 240 
agtaacacgt ggcatggaac attccccatt aacgcgtaca ccacgggccc ctgcacgccc 300 
tccccggcgc caaattattc tagggcgctg tggcgggtgg ctgctgagga gtacgtggag 360 
gttacgcggg tgggggattt ccactacgtg acgggcatga ccactgacaa cgtaaagtgc 420 
ccgtgtcagg ttccggcccc cgaattcttc acagaagtgg atggggtgcg gttgcacagg 480 
tacgctccag cgtgcaaacc cctcctacgg gaggaggtca cattcctggt cgggctcaat 540 
caatacctgg ttgggtcaca gctcccatgc gagcccgaac cggacgtagc agtgctcact 600 
tccatgctca ccgacccctc ccacattacg gcggagacgg ctaagcgtag gctggccagg 660 
ggatctcccc cctgcttggc ciagctcatca gctagccagc tgtctgcgcc ttccttgaag 720 
gcaacatgca ctacccgtca tgactccccg gacgctgacc tcatcgaggc caacctcctg 780 
tggcggcagg agatgggcgg gaacatcacc cgcgtggagt cagaaaataa ggtagtaatt 840 
ttggactctt tcgagccgct ccaagcggag gaggatgaga gggaagtatc cgttccggcg 900 
gagatcctgc ggaggtccag gaaattccct cgagcgatgc ccatatgggc acgcccggat 960 
tacaaccctc cactgttaga gtcctggaag gacccggact acgtccctcc agtggtacac 1020 
gggtgtccat tgccgcctgc caaggcccct ccgataccac ctccacggag gaagaggacg 1080 
gttgtcctgt cagaatctac cgtgtcttct gccttggcgg agctcgccac aaagaccttc 1140 
ggcagctccg aatcgtcggc cgtcgacagc ggcacggcaa cggcctctcc tgaccagccc 1200 
tccgacgacg gcgacgcggg atccgacgtt gagtcgtact cctccatgcc cccccttgag 1260 
ggggagccgg gggatcccga tctcagcgac gggtcttggt ctaccgtaag cgaggaggct 1320 
agtgaggacg tcgtctgctg c 1341 

<210> 12 
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<211> 1341 
<212> DNA 

<213> Hepatitis C virus 
<400> 12 

tccggctcgt ggctaagaga tgtttgggat tggatatgca cggtgttgac tgatttcaag 60 
acctggctcc agtccaagct cctgccgcga ttgccgggag tccccttctt ctcatgtcaa 120 
cgtgggtaca agggagtctg gcggggcgac ggcatcatgc aaaccacctg cccatgtgga 180 
gcacagatca ccggacatgt gaaaaacggt tccatgagga tcgtggggcc taggacctgt 240 
agtaacacgt ggcatggaac attccccatt aacgcgtaca ccacgggccc ctgcacgccc 300 
tccccggcgc caaattattc tagggcgctg tggcgggtgg ctgctgagga gtacgtggag 360 
gttacgcggg tgggggattt ccactacgtg acgggcatga ccactgacaa cgtaaagtgc 420 
ccgtgtcagg ttccggcccc cgaattcttc acagaagtgg atggggtgcg gttgcacagg 480 
tacgctccag cgtgcaaacc cctcctacgg gaggaggtca cattcctggt cgggctcaat 540 
caatacctgg ttgggtcaca gctcccatgc gagcccgaac cggacgtagc agtgctcact 600 
tccatgctca ccgacccctc ccacattacg gcggagacgg ctaagcgtag gctggccagg 660 
ggatctcccc cccccttggc cagctcatca gctagccagc tgtctgcgcc ttccttgaag 720 
gcaacatgca ctacccgtca tgactccccg gacgctgacc tcatcgaggc caacctcctg 780 
tggcggcagg agatgggcgg gaacatcacc cgcgtggagt cagaaaataa ggtagtaatt 840 
ttggactctt tcgagccgct ccaagcggag gaggatgaga gggaagtatc cgttccggcg 900 
gagatcctgc ggaggtccag gaaattccct cgagcgatgc ccatatgggc acgcccggat 960 
tacaaccctc cactgttaga gtcctggaag gacccggact acgtccctcc agtggtacac 1020 
gggtgtccat tgccgcctgc caaggcccct ccgataccac ctccacggag gaagaggacg 1080 
gttgtcctgt cagaatctac cgtgtcttct gccttggcgg agctcgccac aaagaccttc 1140 
ggcagctccg aatcgtcggc cgtcgacagc ggcacggcaa cggcctctcc tgaccagccc 1200 
tccgacgacg gcgacgcggg atccgacgtt gagtcgtact cctccatgcc cccccttgag 1260 
ggggagccgg gggatcccga tctcagcgac gggtcttggt ctaccgtaag cgaggaggct 1320 
agtgaggacg tcgtctgctg c 1341 



<210> 13 
<211> 7987 
<212> DNA 

<213> Hepatitis C virus 



<400> 13 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg cccgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggalictg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataatacc 1800 
atggcgccta ttacggccta ctcccaacag acgcgaggcc tacttggctg catcatcact 1860 
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agcctcacag gccgggacag gaaccaggtc gagggggagg tccaagtggt ctccaccgca 1920 
acacaatctt tcctggcgac ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc 1980 
ggctcaaaga cccttgccgg cccaaagggc ccaatcaccc aaatgtacac caatgtggac 2040 
caggacctcg tcggctggca agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc 2100 
ggcagctcgg acctttactt ggtcacgagg catgccgatg tcattccggt gcgccggcgg 2160 
ggcgacagca gggggagcct actctccccc aggcccgtct cctacttgaa gggctcttcg 2220 
ggcggtccac tgctctgccc ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc 2280 
acccgagggg ttgcgaaggc ggtggacttt gtacccgtcg agtctatgga aaccactatg 2340 
cggtccccgg tcttcacgga caactcgtcc cctccggccg taccgcagac attccaggtg 2400 
gcccatctac acgcccctac tggtagcggc aagagcacta aggtgccggc tgcgtatgca 2460 
gcccaagggt ataaggtgct tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg 2520 
gcgtatatgt ctaaggcaca tggtatcgac cctaacatca gaaccggggt aaggaccatc 2580 
accacgggtg cccccatcac gtactccacc tatggcaagt ttcttgccga cggtggttgc 2640 
tctgggggcg cctatgacat cataatatgt gatgagtgcc actcaactga ctcgaccact 2700 
atcctgggca tcggcacagt cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg 2760 
ctcgccaccg ctacgcctcc gggatcggtc accgtgccac atccaaacat cgaggaggtg 2820 
gctctgtcca gcactggaga aatccccttt tatggcaaag ccatccccat cgagaccatc 2880 
aaggggggga ggcacctcat tttctgccat tccaagaaga aatgtgatga gctcgccgcg 2940 
aagctgtccg gcctcggact caatgctgta gcatattacc ggggccttga tgtatccgtc 3000 
ataccaacta gcggagacgt cattgtcgta gcaacggacg ctctaatgac gggctttacc 3060 
ggcgatttcg actcagtgat cgactgcaat acatgtgtca cccagacagt cgacttcagc 3120 
ctggacccga ccttcaccat tgagacgacg accgtgccac aagacgcggt gtcacgctcg 3180 
cagcggcgag gcaggactgg taggggcagg atgggcattt acaggtttgt gactccagga 3240 
gaacggccct cgggcatgtt cgattcctcg gttctgtgcg agtgctatga cgcgggctgt 3300 
gcttggtacg agctcacgcc cgccgagacc tcagttaggt tgcgggctta cctaaacaca 3360 
ccagggttgc ccgtctgcca ggaccatctg gagttctggg agagcgtctt tacaggcctc 3420 
acccacatag acgcccattt cttgtcccag acrtaagcagg caggagacaa cttcccctac 3480 
ctggtagcat accaggctac ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac 3540 
caaatgtgga agtgtctcat acggctaaag cctacgctgc acgggccaac gcccctgctg 3600 
tataggctgg gagccgttca aaacgaggtt actaccacac accccataac caaatacatc 3660 
atggcatgca tgtcggctga cctggaggtc gtcacgagca cctgggtgct ggtaggcgga 3720 
gtcctagcag. ctctggccgc gtattgcctg acaacaggca gcgtggtcat tgtgggcagg 3780 
atcatcttgt ccggaaagcc ggccatcatt cccgacaggg aagtccttta ccgggagttc 3840 
gatgagatgg aagagtgcgc ctcacacctc ccttacatcg aacagggaat gcagctcgcc 3900 
gaacaattca aacagaaggc aatcgggttg ctgcaaacag ccaccaagca agcggaggct 3960 
gctgctcccg tggtggaatc caagtggcgg accctcgaag ccttctgggc gaagcatatg 4020 
tggaatttca tcagcgggat acaatattta gcaggcttgt ccactctgcc tggcaacccc 4080 
gcgatagcat cactgatggc attcacagcc tctat caeca gcccgctcac cacccaacat 4140 
accctcctgt ttaacatcct ggggggatgg gtggccgccc aacttgctcc tcccagcgct 4200 
gcttctgctt tcgtaggcgc cggcatcgct ggagcggctg ttggcagcat aggccttggg 4260 
aaggtgcttg tggatatttt ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc 4320 
tttaaggtca tgagcggcga gatgccctcc accgaggacc tggttaacct actccctgct 4380 
atcctctccc ctggcgccct agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac 4440 
gtgggcccag gggagggggc tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg 4500 
ggtaaccacg tctcccccac gcactatgtg cctgagagcg acgctgcagc acgtgtcact 4560 
cagatcctct ctagtcttac catcactcag ctgctgaaga ggctt caeca gtggatcaac 4620 
gaggactgct ccacgccatg ptccggctcg tggctaagag atgtttggga ttggatatgc 4680 
acggtgttga ctgatttcaa gacctggctc cagtccaagc tcctgccgcg attgccggga 4740 
gtccccttct tctcatgtca acgtgggtac aagggagtct ggcggggcga cggcatcatg 4800 
caaaccacct gcccatgtgg agcacagatc accggacatg tgaaaaacgg ttccatgagg 4860 
atcgtggggc ctaggacctg tagtaacacg tggcatggaa cattccccat taaegcgtac 4920 
accacgggcc cctgcacgcc ctccccggcg ccaaattatt ctagggcgct gtggcgggtg 4980 
gctgctgagg agtacgtgga ggttacgcgg gtgggggatt tccactacgt gacgggcatg 5040 
accactgaca acgtaaagtg cccgtgtcag gttccggccc ccgaattctt cacagaagtg 5100 
gatggggtgc ggttgcacag gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc 5160 
acattcctgg tcgggctcaa tcaatacctg gttgggtcac agctcccatg cgagcccgaa 5220 
ccggacgtag cagtgctcac ttccatgctc accgacccct cccacattac ggcggagacg 5280 
gctaagcgta ggctggccag gggatctccc ccctccttgg ccagctcatc agctatccag 5340 
ctgtctgcgc cttccttgaa ggcaacatgc actacccgtc atgactcccc ggacgctgac 5400 
ctcatcgagg ccaacctcct gtggcggcag gagatgggcg ggaacatcac ccgcgtggag 5460 
tcagaaaata aggtagtaat tttggactct ttcgagccgc tccaagcgga ggaggatgag 5520 
agggaagtat ccgttccggc ggagatcctg cggaggtcca ggaaattccc tcgagcgatg 5580 
cccatatggg cacgcccgga ttacaaccct ccactgttag agtcctggaa ggacccggac 5640 
tacgtccctc cagtggtaca cgggtgtcca ttgccgcctg ccaaggcccc tccgatiacca 5700 
cctccacgga ggaagaggac ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg 5760 
gagctcgcca caaagacctt cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca 5820 
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acggcctctc ctgaccagcc ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac 5880 
tcctccatgc ccccccttga gggggagccg ggggatcccg atctcagcga cgggtcttgg 5940 
tctaccgtaa gcgaggaggc tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg 6000 
acaggcgccc tgatcacgcc atgcgctgcg gaggaaacca agctgcccat caatgcactg 6060 
agcaactctt tgctccgtca ccacaacttg gtctatgcta caacatctcg cagcgcaagc 6120 
ctgcggcaga agaaggtcac ctttgacaga ctgcaggtcc tggacgacca ctaccgggac 6180 
gtgctcaagg agatgaaggc gaaggcgtcc acagttaagg ctaaacttct atccgtggag 6240 
gaagcctgta agctgacgcc cccacattcg gccagatcta aatttggcta tggggcaaag 6300 
gacgtccgga acctatccag caaggccgtt aaccacatcc gctccgtgtg gaaggacttg 6360 
ctggaagaca ctgagacacc aattgacacc accatcatgg caaaaaatga ggttttctgc 6420 
gtccaaccag agaagggggg ccgcaagcca gctcgcctta tcgtattccc agatttgggg 6480 
gttcgtgtgt gcgagaaaat ggccctttac gatgtggtct ccaccctccc tcaggcegtg 6540 
atgggctctt catacggatt ccaatactct cctggacagc gggtcgagtt cctggtgaat 6600 
gcctggaaag cgaagaaatg ccctatgg|gc ttcgcatatg acacccgctg ttttgactca 6660 
acggtcactg agaatgacat ccgtgttgag gagtcaatct accaatgttg tgacttggcc 6720 
cccgaagcca gacaggccat aaggtcgctc acagagcggc tttacatcgg gggccccctg 6780 
actaattcta aagggcagaa ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg 6840 
accagctgcg gtaataccct cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg 6900 
aagctccagg actgcacgat gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc 6960 
gcggggaccc aagaggacga ggcgagccta cgggccttca cggaggctat gactagatac 7020 
tctgcccccc ctggggaccc gcccaaacca gaatacgact tggagttgat aacatcatgc 7080 
tcctccaatg tgtcagtcgc gcacgatgca tctggcaaaa gggtgtacta tctcacccgt 7140 
gaccccacca ccccccttgc gcgggctgcg tgggagacag ctagacacac tccagtcaat 7200 
tcctggctag gcaacatcat catgtatgcg cccaccttgt gggcaaggat gatcctgatg 7260 
actcatttct tctccatcct tctagctcag gaacaacttg aaaaagccct agattgtcag 7320 
atctacgggg cctgttactc cattgagcca cttgacctac ctcagatcat tcaacgactc 7380 
catggcctta gcgcattttc actccatagt tactctccag gtgagatcaa tagggtggct 7440 
tcatgcctca ggaaacttgg ggtaccgccc ttgcgagtct ggagacatcg ggccagaagt 7500 
gtccgcgcta ggctactgtc ccaggggggg agggctgcca cttgtggcaa gtacctcttc 7560 
aactgggcag taaggaccaa gctcaaactc actccaatcc cggctgcgtc ccagttggat 7620 
ttatccagct ggttcgttgc tggttacagc gggggagaca tatatcacag cctgtctcgt 7680 
gcccgacccc gctggttcat gtggtgccta ctcctacttt ctgtaggggt aggcatctat 7740 
ctactcccca accgatgaac ggggagctaa acactccagg ccaataggcc atcctgtttt 7800 
tttccctttt tttttttctt tttttttttt tttttttttt tttttttttt ctcctttttt 7860 
tttcctcttt ttttcctttt ctttcctttg gtggctccat cttagcccta gtcacggcta 7920 
gctgtgaaag gtccgtgagc cgcttgactg cagagagtgc tgatactggc ctctctgcag 7980 
atcaagt 7987 

<210> 14 
<211> 400 
<212> PRT 

<213> Hepatitis C virus 
<400> 14 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thx Val Leu 
1 5 10 ' 15 



Tbr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 

35 40 . . 45 

Gly Asp Gly He Met Gla Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 55 . 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 * 70 75 80 

S^r Asn Thr Trp Bis Gly Thr Phe Pro He Asn Ala Tyr Thr Thi: Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 
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VaX Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly lieu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Tyr Ser Phe Glu Pro Leu 
225 230 235 240 

Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu lie Leu 
245 250 255 

Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro 
260 265 270 

Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val 
275 280 285 

Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro 
290 295 300 

He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr 
305 310 315 320 

Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser 
325 330 335 

Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin 
340 345 350 

Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser 
355 360 365 

Met Pro Pro lieu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly 
370 375 380 



Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
385 390 395 400 



<210> 15 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 15 

Met Ala Pro He Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 10 15 
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Cys lie lie Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro He Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Arg Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro lieu 
130 ' 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg* Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 
165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 
195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 

225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly He Asp Pro Asn He Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro He Thr Tyr Ser Thr Tyr Gly 
260 265 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 295 300 

Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 
305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 
325 330 335 



He Glu Glu Val Ala Leu Ser Ser Thr Gly Glu He Pro Phe Tyr Gly 
340 345 350 

Lys Ala He Pro He Glu Thr He Lys Gly Gly Arg His Leu He Phe 
355 360 365 
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Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu Asp Val Ser Val 
385 390 395 400 

lie Pro Thr Ser - Gly Asp Val He Val Val Ala Thr Asp laa Leu Met 
405 410 415 

Thr Gly Phe Thr Gly Asp Phe Asp Ser Val He Asp cys Asn Thr Cys 
420 425 430 

Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr He Glu 
435 440 445 

Thr Thr Thr Val Pro. Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 

Arg Thr Gly Arg Gly Arg Met Gly He Tyr Arg Phe Val Thr Pro Gly 
465 470 475 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 

Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 

515. 520 525 

His Leu Glu Phe Trp Glu Ser Val Phe Thr Gly Leu Thr His He Asp 
530 535 540 

Ala His Phe Leu Ser Gin Thr Lys 63Ln Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 

LeuyVal Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 

Pro Ser Trp Asp Gin Met Trp Lys Cys Leu He Arg Leu Lys Pro Thr 
580 585 590 

Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn 
595 . 600 605 

Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys Met 
610 615 620 

Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 

He Val Gly Arg He He Leu Ser Gly Lys Pro Ala He He Pro Asp 
660 665 670 

Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu Cys Ala Ser 
675 680 685 

His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys 
690 695 700 

Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala 
705 710 715 720 
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Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe lie Ser Gly He Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala lie Ala Ser Leu Met Ala Phe 

755 760 765 

Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 

Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 

He Gly I.eu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val Ala Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 

Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro 
850 855 860 

Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 . 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
685 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 ' 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Ser Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 935 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 

Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly 
980 985 990 

Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 
995 1000 ' 1005 

Gin He Thr Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro 
1010 1015 1020 

Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr 
1025 1030 1035 1040 

Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala 
1045 1050 1055 

Leu Trp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly 
1060 1065 1070 
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Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Aan Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 1110 1115 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly 
1155 1160 1165 

Ser Pro Pto Ser Leu Ala Ser Ser Ser Ala He Gin Leu Ser Ala Pro 
1170 1175 1180 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 1240 1245 

He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala 
1285 1290 1295 

Pro Pro He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 



Ser Ser Glu Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 

Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys 
1395 1400 1405 

Ala Ala Glu Glu Thr Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu 
1410 1415 1420 



1300 



1305 



1310 



Ser Thr Val Ser Ser Ala Leu T^a 
1315 1320 



Glu Leu Ala Thr Lys Thr Phe Gly 

1325 
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Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val 
1460 1465 1470 

Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Ly? Leu Thr Pro Pro 
1475 1480 1485 

His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 1500 

Leu Ser Ser Lys Ala Val Asn His lie Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 

Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn 
1525 1530 1535 

Glu Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 

Leu lie Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 

Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
1585 1590 1595 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp lie Arg- Val Glu Glu Ser 
1620 1625 1630 

He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg 
1635 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 . 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
1665 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 1690 1695 

Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp 
1700 1705 1710 

Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 

Ser Leu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr 
1765 1770 1775 
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Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1630 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1655 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 

Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala 
1925 1930 1935 

Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly 
1940 1945 1950 

Asp He Tyr His Ser Leu Ser Arg Ala Arg ..Pro Arg Trp Phe Met Trp 
1955 1960 1965 

Cys Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn 
1970 1975 1980 

Arg 
1985 



<210> 16 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 16 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
15 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 

20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 
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Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
160 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala He Gin Leu Ser Ala. Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp- Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 



Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 
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Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 17 
<211> 1985 
<212> PRT 

<213> Hepatitis C virus 
<400> 17 

Met Ala Pro lie Thr Ala Tyr Ser Gin Gin Thr Arg Gly Leu Leu Gly 
1 5 . 10 15 

Cys lie lie Thr Ser Leu Thr Gly Arg Asp Arg Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin Val Val Ser Thr Ala Thr Gin Ser Phe Leu Ala Thr Cys 
35 40 45 

Val Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Ser Lys Thr 
50 55 60 

Leu Ala Gly Pro Lys Gly Pro lie Thr Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Gin Ala Pro Pro Gly Ala Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

Ser Pro Arg Pro Val Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 140 

Leu Cys Pro Ser Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser Met 

165 170 175 

Glu Thr Thr Met Arg Ser Pro Val Phe Thr Asp Asn Ser Ser Pro Pro 
180 185 190 

Ala Val Pro Gin Thr Phe Gin Val Ala His Leu His Ala Pro Thr Gly 

195 200 205 

Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala Ala Gin Gly Tyr 
210 215 220 

Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala Thr Leu Gly Phe Gly 
225 230 235 240 

Ala Tyr Met Ser Lys Ala His Gly lie Asp Pro Asn lie Arg Thr Gly 
245 250 255 

Val Arg Thr He Thr Thr Gly Ala Pro lie Thr Tyr Ser Thr Tyr Gly 
260 265 270 

Lys Phe Leu Ala Asp Gly Gly Cys Ser Gly Gly Ala Tyr Asp He He 
275 280 285 

He Cys Asp Glu Cys His Ser Thr Asp Ser Thr Thr He Leu Gly He 
290 295 300 
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Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly Ala Arg Leu Val Val 

305 310 315 320 

Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Val Pro His Pro Asn 
325 330 335 

lie Glu Glu Val Ala Leu Ser Ser Thr Gly Glu lie Pro Phe Tyr Gly 
340 345 350 

Lys Ala lie Pro lie Glu Thr lie Lys Gly Gly Arg His Leu lie Phe 
355 360 365 

Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Ser Gly 
370 375 380 

Leu Gly Leu Asn Ala Val Ala Tyr Tyr Arg . Gly Leu Asp Val Ser Val 
385 390 395 400 

lie Pro Thr Ser Gly Asp Val lie Val Val Ala Thr Asp Ala Leu Met 
405 410 415 

Thr Gly Phe Thr Gly Asp Phe Asp Ser Val lie Asp Cys Asn Thr Cys 
420 425 430 

Val Thr Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe Thr lie Glu 
435 440 445 

Thr Thr Thr Val Pro Gin Asp Ala Val Ser Arg Ser Gin Arg Arg Gly 
450 455 460 

Arg Thr Gly Arg iSly Arg Met Gly lie Tyr Arg Phe Val Thr Pro Gly 
465 470 475 480 

Glu Arg Pro Ser Gly Met Phe Asp Ser Ser Val Leu Cys Glu Cys Tyr 
485 490 495 

Asp Ala Gly Cys Ala Trp Tyr Glu Leu Thr Pro Ala Glu Thr Ser Val 
500 505 510 

Arg Leu Arg Ala Tyr Leu Asn Thr Pro Gly Leu Pro Val Cys Gin Asp 
515 520 525 

His Leu Glu Phe Trp Glu Ser Val ^e Thr Gly Leu Thr His He Asp 
530 535 . 540 

Ala His Phe Leu Ser Gin Thr Lys Gin Ala Gly Asp Asn Phe Pro Tyr 
545 550 555 560 

Leu Val Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Gin Ala Pro Pro 
565 570 575 

Pro Ser Trp Asp Gin Met Trp Glu Cys Leu lie Arg Leu Lys Pro Thr 
580 585 590 

Leu His Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ala Val Gin Asn 
595 600 605 

Glu Val Thr Thr Thr His Pro He Thr Lys Tyr He Met Ala Cys Met 
610 615 620 

Ser Ala Asp Leu Glu Val Val Thr Ser Thr Trp Val Leu Val Gly Gly 
625 630 635 640 



Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Thr Thr Gly Ser Val Val 
645 650 655 
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lie Val Gly Arg He He Leu Ser Gly Lys Fro Ala He He Pro Asp 
660 665 670 

Arg Glu Val Leu Tyr Arg Glii Phe Asp Glu Het Glu Glu Cys Ala Ser 
675 680 685 

His Leu Pro Tyr He Glu Gin Gly Met Gin Leu Ala Glu Gin Phe Lys 
690 695 700 

Gin Lys Ala He Gly Leu Leu Gin Thr Ala Thr Lys Gin Ala Glu Ala 
705 710 715 720 

Ala Ala Pro Val Val Glu Ser Lys Trp Arg Thr Leu Glu Ala Phe Trp 
725 730 735 

Ala Lys His Met Trp Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly 
740 745 750 

Leu Ser Thr Leu Pro Gly Asn Pro Ala He Ala Ser Leu Het Ala Phe 
755 760 765 

Thr Ala Ser He Thr Ser Pro Leu Thr Thr Gin His Thr Leu Leu Phe 
770 775 780 

Asn He Leu Gly Gly Trp Val Ala Ala Gin Leu Ala Pro Pro Ser Ala 
785 790 795 800 

Ala Ser Ala Phe Val Gly Ala Gly He Ala Gly Ala Ala Val Gly Ser 
805 810 815 

He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala 
820 825 830 

Gly Val 7aa Gly Ala Leu Val Ala Phe Lys Val Met Ser Gly Glu Met 
835 840 845 

Pro Ser Thr Glu Asp Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro 
850 855 860 

Gly Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 
865 870 875 880 

Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met Asn Arg Leu He Ala 
885 . 890 895 

Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val Pro Glu 
900 905 910 

Ser Asp Ala Ala Ala Arg Val Thr Gin He Leu Ser Gly Leu Thr He 
915 920 925 

Thr Gin Leu Leu Lys Arg Leu His Gin Trp He Asn Glu Asp Cys Ser 
930 935 940 

Thr Pro Cys Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys 
945 950 955 960 

Thr Val Leu Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro 
965 970 975 

Arg Leu Pro Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly 
980 985 990 

Val Trp Arg Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala 
995 1000 1005 
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Gin lie Thr Gly His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro 
1010 1015 1020 

Arg Thr Cys Ser Asn Thr Trp His Gly Thr Phe Pro lie Asn Ala Tyr 
1025 1030 1035 1040 

Thr Thr Gly Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala 
1045 1050 1055 

Leu Txrp Arg Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly 
1060 1065 1070 

Asp Phe His Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro 
1075 1080 1085 

Cys Gin Val Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg 
1090 1095 1100 

Leu His Arg Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val 
1105 1110 1115 1120 

Thr Phe Leu Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro 
1125 1130 1135 

Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp 
1140 1145 1150 

Pro Ser His lie Thr Ala Glu Thr Ala Lys Arg Gly Leu Ala Arg Gly 
1155 1160 1165 

Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro 
1170 1175 1180 

Ser Leu Lys Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp 
1185 1190 1195 1200 

Leu He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He 
1205 1210 1215 

Thr Arg Val Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu 
1220 1225 1230 

Pro Leu Gin Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu 
1235 . 1240 1245 

He Leu Arg Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala 
1250 1255 1260 

Arg Pro Asp Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp 
1265 1270 1275 1280 

Tyr Val Pro Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala 
1285 1290 1295 

Pro Pro He Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu 
1300 1305 1310 

Ser Thr Val Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly 
1315 1320 1325 

Ser Ser Glu Ser ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro 
1330 1335 1340 

Asp Gin Pro Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr 
1345 1350 1355 1360 
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Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser 
1365 1370 1375 

Asp Gly Ser Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val 
1380 1385 1390 

Cys Cys Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys 
1395 1400 1405 

Ala Ala Glu Glu Thr Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu 
1410 1415 1420 

Leu Arg His His Asn Leu Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser 
1425 1430 1435 1440 

Leu Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp 
1445 1450 1455 

His Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser' Thr Val 
1460 1465 1470 

Lys Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro 
1475 1480 1485 

His Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn 
1490 1495 1500 

Leu Ser Ser Lys Ala Val Asn His lie Arg Ser Val Trp Lys Asp Leu 
1505 1510 1515 1520 

Leu Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn 
1525 1530 1535 

Glu Val Phe Cys Val Gin P3:o Glu Lys Gly Gly Arg Lys Pro Ala Arg 
1540 1545 1550 

Leu lie Val Phe Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala 
1555 1560 1565 

Leu Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser 
1570 1575 1580 

Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn 
. 1585 1590 1595 . . 1600 

Ala Trp Lys Ala Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg 
1605 1610 1615 

Cys Phe Asp Ser Thr Val Thr Glu Asn Asp lie Arg Val Glu Glu Ser 
1620 1625 1630 

lie Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg 
1635 1640 1645 

Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys 
1650 1655 1660 

Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr 
1665 1670 1675 1680 

Thr Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ala Ala Ala 
1685 . 1690 1695 

Cys Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp 
1700 1705 1710 
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Asp Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Glu Ala 
1715 1720 1725 

Ser lieu Arg Ala Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro 
1730 1735 1740 

Gly Asp Pro Pro Lys Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys 
1745 1750 1755 1760 

Ser Ser Asn Val Ser Val Ala His Asp Ala ser Gly Lys Arg Val Tyr 
1765 1770 1775 

Tyr Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu 
1780 1785 1790 

Thr Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met 
1795 1800 1805 

Tyr Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe 
1810 1815 1820 

Ser He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Gin 
1825 1830 1835 1840 

He Tyr Gly Ala Cys Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He 
1845 1850 1855 

He Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser 
1860 1865 1870 

Pro Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val 
1875 1880 1885 

Pro Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Arg 
1890 1895 1900 

Leu Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe 
1905 1910 1915 1920 

Asn Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala 
1925 1930 1935 

Ser Gin Leu Asp Leu Ser Ser Trp Phe Val Ala Gly Tyr Ser Gly Gly 
1940 1945 . 1950 

Asp He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp 
1955 1960 1965 

Cys lieu Leu Leu Leu Ser Val Gly Val Gly He Tyr -Leu Leu Pro Asn 
1970 1975 1980 

Arg 
1985 



<210> 18 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 18 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
15 10 15 
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Thr Asp Pbe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly lie Met Gin Thr Thr Cys Pro Cys Gly Ala Gin lie Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu' Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

He Thr Ala Glu Thr Ma Lys Arg Gly Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Ser Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 

245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Pbe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 
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Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 19 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 19 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Trp lie Cys Thr Val Leu 
15 10 15 

* Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly lie Met Gin Thr Thr Cys Pro Cys Gly Ala Gin lie Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 ' 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115. 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu. Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 

210 215 220 

Ser Leu Ser Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 
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Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu lie Glu 
245 250 255 

Ala Asn X«eu Leu Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg val 
260 265 270 

Glu Ser Glu Asn Lys Val Val lie Leu Asp Ser Phe Glu Pro Leu Gin 
273 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 

305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys .Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 

385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 20 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 20 

Ser Gly Ser Trp Leu Arg Asp Val Trp Asp Irp He Cys Thr Val Leu 
1 5 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
5b 55 60 

Gly His Val Lys Asn Gly Ser Met Arg He Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 
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Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val 2Vrg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

lie Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Cys Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 "235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 

260 . 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 

Tyr Asn Pro Pro Leu Leu Glu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 
355 360 365 

Ser Ser Ala Leu Ala Glu Leu laa Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 



Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 
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<210> .21 
<211> 447 
<212> PRT 

<213> Hepatitis C virus 
<400> 21 

Ser Gly Ser Tacp Leu Arg Asp Val Trp Asp Trp He Cys Thr Val Leu 
15 10 15 

Thr Asp Phe Lys Thr Trp Leu Gin Ser Lys Leu Leu Pro Arg Leu Pro 
20 25 30 

Gly Val Pro Phe Phe Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg 
35 40 45 

Gly Asp Gly He Met Gin Thr Thr Cys Pro Cys Gly Ala Gin He Thr 
50 55 60 

Gly His Val Lys Asn Gly Ser Met Arg lie Val Gly Pro Arg Thr Cys 
65 70 75 80 

Ser Asn Thr Trp His Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly 
85 90 95 

Pro Cys Thr Pro Ser Pro Ala Pro Asn Tyr Ser Arg Ala Leu Trp Arg 
100 105 110 

Val Ala Ala Glu Glu Tyr Val Glu Val Thr Arg Val Gly Asp Phe His 
115 120 125 

Tyr Val Thr Gly Met Thr Thr Asp Asn Val Lys Cys Pro Cys Gin Val 
130 135 140 

Pro Ala Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu His Arg 
145 150 155 160 

Tyr Ala Pro Ala Cys Lys Pro Leu Leu Arg Glu Glu Val Thr Phe Leu 
165 170 175 

Val Gly Leu Asn Gin Tyr Leu Val Gly Ser Gin Leu Pro Cys Glu Pro 
180 185 190 

Glu Pro Asp Val Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His 
195 200 205 

He Thr Ala Glu Thr Ala Lys Arg Arg Leu Ala Arg Gly Ser Pro Pro 
210 215 220 

Pro. Leu Ala Ser Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys 
225 230 235 240 

Ala Thr Cys Thr Thr Arg His Asp Ser Pro Asp Ala Asp Leu He Glu 
245 250 255 

Ala Asn Leu Leu Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val 
260 265 270 

Glu Ser Glu Asn Lys Val Val He Leu Asp Ser Phe Glu Pro Leu Gin 
275 280 285 

Ala Glu Glu Asp Glu Arg Glu Val Ser Val Pro Ala Glu He Leu Arg 
290 295 300 

Arg Ser Arg Lys Phe Pro Arg Ala Met Pro He Trp Ala Arg Pro Asp 
305 310 315 320 
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Tyr Asn Pro Pro Leu Leu 6lu Ser Trp Lys Asp Pro Asp Tyr Val Pro 
325 330 335 

Pro Val Val His Gly Cys Pro Leu Pro Pro Ala Lys Ala Pro Pro He 
340 345 350 

Pro Pro Pro Arg Arg Lys Arg Thr Val Val Leu Ser Glu Ser Thr Val 

355 360 365 

Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser Glu 
370 375 380 

Ser Ser Ala Val Asp Ser Gly Thr Ala Thr Ala Ser Pro Asp Gin Pro 
385 390 395 400 

Ser Asp Asp Gly Asp Ala Gly Ser Asp Val Glu Ser Tyr Ser Ser Met 
405 410 415 

Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser 
420 425 430 

Trp Ser Thr Val Ser Glu Glu Ala Ser Glu Asp Val Val Cys Cys 
435 440 445 



<210> 22 
<211> 7789 
<212> DHA 

<213> Hepatitis C virus 
<400> 22 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgbag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac cagaccacaa cggtttccct 360 
ctagcgggat caattccgcc cctctccctc ccccccccct aacgttactg gccgaagccg 420 
cttggaataa ggccggtgtg cgtttgtcta tatgttattt tccaccatat tgccgtcttt 480 
tggcaatgtg agggcccgga aacctggccc tgtcttcttg acgagcattc ctaggggtct 540 
ttcccctctc gccaaaggaa tgcaaggtct gttgaatgtc gtgaaggaag cagttcctct 600 
ggaagcttct tgaagacaaa caacgtctgt agcgaccctt tgcaggcagc ggaacccccc 660 
acctggcgac aggtgcctct gcggccaaaa gccacgtgta taagatacac ctgcaaaggc 720 
ggcacaaccc^ cagtgccacg ttgtgagttg gatagttgtg gaaagagtca aatggctctc. 780 
ctcaagcgta ttcaacaagg ggctgaagga tgcccagaag gtaccccatt gtatgggatc B40 
tgatctgggg cctcggtgca catgctttac atgtgtttag tcgaggttaa aaaacgtcta 900 
ggccccccga accacgggga cgtggttttc ctttgaaaaa cacgataata ccatggaccg 960 
ggagatggca gcatcgtgcg gaggcgcggt tttcgtaggt ctgatactct tgaccttgtc 1020 
accgcactat aagctgttcc tcgctaggct cratatggtgg ttacaatatt ttatcaccag 1080 
ggccgaggca cacttgcaag tgtggatccc ccccctcaac gttcgggggg gccgcgatgc 1140 
cgtcatcctc ctcacgtgcg cgatccaccc agagctaatc tttaccatca ccaaaatctt 1200 
gctcgccata ctcggtccac tcatggtgct ccaggctggt ataaccaaag tgccgtactt 1260 
cgtgcgcgca cacgggctca ttcgtgcatg catgctggtg cggaaggttg ctgggggtca 1320 
ttatgtccaa atggctctca tgaagttggc cgcactgaca ggtacgtacg tttatgacca 1380 
tctcacccca ctgcgggact gggcccacgc gggcctacga gaccttgcgg tggcagttga 1440 
gcccgtcgtc ttctctgata tggagaccaa ggttatcacc tggggggcag acaccgcggc 1500 
gtgtggggac atcatcttgg gcctgcccgt ctccgcccgc agggggaggg agatacatct 1560 
gggaccggca gacagccttg aagggcaggg gtggcgactc ctcgcgccta ttacggccta 1620 
ctcccaacag acgcgaggcc tacttggctg catcatcact agcctcacag gccgggacag 1680 
gaaccaggtc gagggggagg tccaagtggt ctccaccgca acacaatctt tcctggcgac 1740 
ctgcgtcaat ggcgtgtgtt ggactgtcta tcatggtgcc ggctcaaaga cccttgccgg 1800 
cccaaagggc ccaatcaccc aaatgtacac caatgtggac caggacctcg tcggctggca 1860 
agcgcccccc ggggcgcgtt ccttgacacc atgcacctgc ggcagctcgg acctttactt 1920 
ggtcacgagg catgccgatg tcattccggt gcgccggcgg ggcgacagca gggggagcct 1980 
actctccccc aggcccgtct cctacttgaa gggctcttcg ggcggtccac tgctctgccc 2040 
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ctcggggcac gctgtgggca tctttcgggc tgccgtgtgc acccgagggg ttgcgaaggc 2100 
ggtggacttt gtacccgtcg agtctatgga aaccactatg cggtccccgg tcttcacgga 2160 
caactcgtcc cctccggccg taccgcagac attccaggtg gcccatctac acgcccctac 2220 
tggtagcggc aagagcacta aggtgccggc tgcgtatgca gcccaagggt ataaggtgct 2280 
tgtcctgaac ccgtccgtcg ccgccaccct aggtttcggg gcgtatatgt ctaaggcaca 2340 
tggtatcgac cctaacatca gaaccggggt aaggaccatc accacgggtg cccccatcac 2400 
gtactccacc tatggcaagt ttcttgccga cggtggttgc tctgggggcg cctatgacat 2460 
cataatatgt gatgagtgcc actcaactga ctcgaccact atcctgggca tcggcacagt 2520 
cctggaccaa gcggagacgg ctggagcgcg actcgtcgtg ctcgccaccg ctacgcctcc 2580 
gggatcggtc accgtgccac atccaaacat cgaggaggtg gctctgtcca gcactggaga 2640 
aatccccttt tatggcaaag ccatccccat cgagaccatc aaggggggga ggcacctcat 2700 
tttctgccat tccaagaaga aatgtgatga gctcgccgcg aagctgtccg gcctcggact 2760 
caatgctgta gcatattacc ggggccttga tgtatccgtc ataccaacta gcggagacgt 2820 
cattgtcgta gcaacggacg ctctaatgac gggctttacc ggcgatttcg actcagtgat 2880 
cgactgcaat acatgtgtca cccagacagt cgacttcagc ctggacccga cctt caeca t 2940 
tgagacgacg accgtgccac aagacgcggt gtcacgctcg cagcggcgag gcaggactgg 3000 
taggggcagg atgggcattt acaggtttgt gactccagga gaacggccjct cgggcatgtt 3060 
cgattcctcg gttctgtgcg agtgctatga cgcgggctgt gcttggtacg agctcacgcc 3120 
cgccgagacc tcagttaggt tgcgggctta cctaaacaca ccagggttgc ccgtctgcca 3180 
ggaccatctg gagttctggg agagcgtctt tacaggcctc acccacatag acgcccattt 3240 
cttgtcccag actaagcagg caggagacaa cttcccctac ctggtagcat accaggctac 3300 
ggtgtgcgcc agggctcagg ctccacctcc atcgtgggac caaatgtgga agtgtctcat 3360 
acggctaaag cctacgctgc acgggccaac gcccctgctg tataggctgg gagccgttca 3420 
aaacgaggtt actaccacac accccataac caaatacatc atggcatgca tgtcggctga 3480 
cctggaggtc gtcacgagca cctgggtgct ggtaggcgga gtcctagcag ctctggccgc 3540 
gtattgcctg acaacaggca gcgtggtcat tgtgggcagg atcatcttgt ccggaaagcc 3600 
ggccatcatt cccgacaggg aagtccttta ccgggagttc gatgagatgg aagagtgcgc 3660 
ctcacacctc ccttacatcg aacagggaat gcagctcgcc gaacaattca aacagaaggc 3720 
aatcgggttg ctgcaaacag ccaccaagca agcggaggct gctgctcccg tggtggaatc 3780 
caagtggcgg accctcgaag ccttctgggc gaagcatatg tggaatttca tcagcgggat 3840 
acaatattta gcaggcttgt ccactctgcc tggcaacccc gcgatagcat cactgatggc 3900 
attcacagcc tctatcacca gcccgctcac cacccaacat accctcctgt ttaacatcct 3960 
ggggggatgg gtggccgccc aacttgctcc tcccagcgct gcttctgctt tcgtaggcgc 4020 
cggcatcgct ggagcggctg ttggcagcat aggccttggg aaggtgcttg tggatatttt 4080 
ggcaggttat ggagcagggg tggcaggcgc gctcgtggcc tttaaggtca tgagcggcga 4140 
gatgccctcc accgaggacc tggttaacct actccctgct atcctctccc ctggcgccct 4200 
agtcgtcggg gtcgtgtgcg cagcgatact gcgtcggcac gtgggcccag gggagggggc 4260 
tgtgcagtgg atgaaccggc tgatagcgtt cgcttcgcgg ggtaaccacg tctcccccac 4320 
gcactatgtg cctgagagcg acgctgcagc acgtgtcact cagatcctct ctagtcttac 4380 
catcactcag ctgctgaaga ggcttcacca gtggatcaac gaggactgct ccacgccatg 4440 
ctccggctcg tggctaagag atgtttggga ttggatatgc acggtgttga ctgatttcaa 4500 
gacctggctc cagtccaagc tcctgccgcg attgccggga gtccccttct tctcatgtca 4560 
acgtgggtac aagggagtct ggcggggcga cggcatcatg caaaccacct gcccatgtgg 4620 
agcacagatc accggacatg tgaauacgg -ttccatgagg atcgtggggc ctaggacctg 4680 
tagtaacacg tggcatggaa cattccccat taacgcgtac accacgggcc cctgcacgcc 4740 
ctccccggcg ccaaattatt ctagggcgct gtggcgggtg gctgctgagg agtacgtgga 4800 
ggttacgcgg gtgggggatt tccactacgt gacgggcatg accactgaca acgtaaagtg 4860 
cccgtgtcag gttccggccc ccgaattctt cacagaagtg gatggggtgc ggttgcacag 4920 
gtacgctcca gcgtgcaaac ccctcctacg ggaggaggtc acattcctgg tcgggctcaa 4980 
tcaatacctg gttgggtcac agctcccatg cgagcccgaa ccggacgtag cagtgctcac 5040 
ttccatgctc accgacccct cccacattac ggcggagacg gctaagcgta ggctggccag 5100 
gggatctccc ccctccttgg ccagctcatc agctatccag ctgtctgcgc cttccttgaa 5160 
ggcaacatgc actacccgtc atgactcccc ggacgctgac ctcatcgagg ccaacctcct 5220 
gtggcggcag gagatgggcg ggaacatcac ccgcgtggag tcagaaaata aggtagtaat 5280 
tttggactct ttcgagccgc tccaagcgga ggaggatgag agggaagtat ccgttccggc 5340 
ggagatcctg cggaggtcca ggaaattccc tcgagcgatg cccatatggg cacgcccgga 5400 
ttacaaccct ccactgttag agtcctggaa ggacccggac tacgtccctc cagtggtaca 5460 
cgggtgtcca ttgccgcctg ccaaggcccc tccgatacca cctccacgga ggaagaggac 5520 
ggttgtcctg tcagaatcta ccgtgtcttc tgccttggcg gagctcgcca caaagacctt 5580 
cggcagctcc gaatcgtcgg ccgtcgacag cggcacggca acggcctctc ctgaccagcc 5640 
ctccgacgac ggcgacgcgg gatccgacgt tgagtcgtac tcctccatgc ccccccttga 5700 
9ggggagccg ggggatcccg atctcagcga cgggtcttgg tctaccgtaa gcgaggaggc 5760 
tagtgaggac gtcgtctgct gctcgatgtc ctacacatgg acaggcgccc tgatcacgcc 5820 
atgcgctgcg gaggaaacca agctgcccat caatgcactg agcaactctt tgctccgtca 5880 
ccacaacttg gtctatgcta caacatctcg cagcgcaagc ctgeggcaga agaaggtcac 5940 
ctttgacaga ctgcaggtcc tggacgacca ctaccgggac gtgctcaagg agatgaaggc 6000 
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gaaggcgtcc acagttaagg ctaaacttct atccgtggag gaagcctgta agctgacgcc 6060 
cccacattcg gccagatcta aatttggcta tggggcaaag gacgtccgga acctatccag 6120 
caaggccgtt aaccacatcc gctccgtgtg gaaggacttg ctggaagaca ctgagacacc 6180 
aattgacacc accatcatgg caaaaaatga ggttttctgc gtccaaccag agaagggggg 6240 
ccgcaagcca gctcgcctta tcgtattccc agatttgggg gttcgtgtgt gcgagaaaat 6300 
ggccctttac gatgtggtct ccaccctccc tcaggccgtg atgggctctt catacggatt 6360 
ccaatactct cctggacagc gggtcgagtt cctggtgaat gcctggaaag cgaagaaatg 6420 
ccctatgggc ttcgcatatg acacccgctg ttttgactca acggtcactg agaatgacat 6480 
ccgtgttgag gagtcaatct accaatgttg tgacttggcc cccgaagcca gacaggccat 6540 
aaggtcgctc acagagcggc tttacatcgg gggccccctg actaattcta aagggcagaa 6600 
ctgcggctat cgccggtgcc gcgcgagcgg tgtactgacg accagctgcg gtaataccct 6660 
cacatgttac ttgaaggccg ctgcggcctg tcgagctgcg aagctccagg actgcacgat 6720 
gctcgtatgc ggagacgacc ttgtcgttat ctgtgaaagc gcggggaccc aagaggacga 6780 
ggcgagccta cgggccttca cggaggctat gactagatac tctgcccccc ctggggaccc 6840 
gcccaaacca gaatacgact tggagttgat aacatcatgc tcctccaatg tgtcagtcgc 6900 
gcacgatgca tctggcaaaa gggtgtacta tctcacccgt gaccccacca ccccccttgc 6960 
gcgggctgcg tgggagacag ctagacacac tccagtcaat tcctggctag gcaacatcat 7020 
catgtatgcg cccaccttgt gggcaaggat gatcctgatg actcatttct tctccatcct 7080 
tctagctcag gaacaacttg aaaaagccct agattgtcag atctacgggg cctgttactc 7140 
cattgagcca cttgacctac ctcagatcat tcaacgactc catggcctta gcgcattttc 7200 
actccatagt tactctccag gtgagatcata tagggtggct tcatgcctca ggaaacttgg 7260 
ggtaccgccc ttgcgagtct ggcigacatcg- ggccagaagt gtccgcgcta ggctactgtc 7320 
ccaggggggg agggctgcca cttgtggcaa gtacctcttc aactgggcag taaggaccaa 7380 
gctcaaactc actccaatcc cggctgcgtc ccagttggat ttatccagct ggttcgttgc 7440 
tggttacagc gggggagaca tatatcacag cctgtctcgt gcccgacccc gctggttcat 7500 
gtggtgccta ctcctacttt ctgtaggggt aggcatctat ctactcccca accgatgaac 7560 
ggggacctaa acactccagg ccaataggcc atcctgtttt tttccctttt tttttttctt 7620 
tttttttttt tttttttttt tttttttttt ttctcctttt tttttcctct ttttttcctt 7680 
ttctttcctt tggtggctcc atcttagccc tagtcacggc tagctgtgaa aggtccgtga 7740 
gccgcttgac tgcagagagt gctgatactg gcctctctgc agatcaagt 7789 

<210> 23 
<211> 11062 
<212> DNA 

<213> Hepatitis C virus 
<400> 23 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaaggg cgcgccatga ttgaacaaga tggattgcac gcaggttctc 420 
cggccgcttg ggtggagagg ctattcggct atgactgggc acaacagaca atcggctgct 480 
ctgatgccgc cgtgttccgg ctgtcagcgc aggggcgccc ggttcttttt gtcaagaccg 540 
acctgtccgg tgccctgaat gaactgcagg acgaggcagc gcggctatcg tggctggcca 600 
cgacgggcgt tccttgcgca gctgtgctcg acgttgtcac tgaagcggga agggactggc 660 
tgctattggg cgaagtgccg gggcaggatc tcctgtcatc tcaccttgct cctgccgaga 720 
aagtatccat catggctgat gcaatgcggc ggctgcatac gcttgatccg gctacctgcc 780 
cattcgacca ccaagcgaaa catcgcatcg agcgagcacg tactcggatg gaagccggtc 840 
ttgtcgatca ggatgatctg gacgaagagc atcaggggct cgcgccagcc gaactgttcg 900 
ccaggctcaa ggcgcgcatg occgacggcg aggatctcgt cgtgacccat ggcgatgcct 960 
gcttgccgaa tatcatggtg gaaaatggcc gcttttctgg attcatcgac tgtggccggc 1020 
tgggtgtggc ggaccgctat caggacatag cgttggctac ccgtgatatt gctgaagagc 1080 
ttggcggcga atgggctgac cgcttcctcg tgctttacgg tatcgccgct cccgattcgc 1140 
agcgcatcgc cttctatcgc cttcttgacg agttcttctg agtttaaaca gaccacaacg 1200 
gtttccctct agcgggatca attccgcccc tctccctccc ccccccctaa cgttactggc 1260 
cgaagccgct tggaataagg ccggtgtgcg tttgtctata tgttattttc caccatattg 1320 
ccgtcttttg gcaatgtgag ggcccggaaa cctggccctg tcttcttgac gagcattcct 1380 
aggggtcttt cccctctcgc caaaggaatg caaggtctgt tgaatgtcgt gaaggaagca 1440 
gttcctctgg aagcttcttg aagacaaaca acgtctgtag cgaccctttg caggcagcgg 1500 
aaccccccac ctggcgacag gtgcctctgc ggccaaaagc cacgtgtata agatacacct 1560 
gcaaaggcgg cacaacccca gtgccacgtt gtgagttgga tagttgtgga aagagtcaaa 1620 
tggctctcct caagcgtatt caacaagggg ctgaaggatg cccagaaggt accccattgt 1680 
atgggatctg atctggggcc tcggtgcaca tgctttacat gtgtttagtc gaggttaaaa 1740 
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aacgtctagg ccccccgaac cacggggacg tggttttcct ttgaaaaaca cgataataat 1800 
gagcacgaat cctaaacctc aaagaaaaac caaacgtaac accaaccgcc gcccacagga 1860 
cgtcaagttc ccgggcggtg gtcagatcgt cggtggagtt tacctgttgc cgcgcagggg 1920 
ccccaggttg ggtgtgcgcg cgactaggaa gacttccgag cggtcgcaac ctcgtggaag 1980 
gcgacaacct atccccaagg ctcgccagcc cgagggtagg gcctgggctc agcccgggta 2040 
cccctggccc ctctatggca atgagggctt ggggtgggca ggatggctcc tgtcaccccg 2100 
tggctctcgg cctagttggg gccccacgga cccccggcgt aggtcgcgca atttgggtaa 2160 
ggtcatcgat accctcacgt gcggcttcgc cgatctcatg gggtacattc cgctcgtcgg 2220 
cgccccccta gggggcgctg ccagggccct ggcgcatggc gtccgggttc tggaggacgg 2280 
cgtgaactat gcaacaggga atctgcccgg ttgctccttt tctatcttcc ttttggcttt 2340 
gctgtcctgt ttgaccatcc cagcttccgc ttatgaagtg cgcaacgtat ccggagtgta 2400 
ccatgtcacg aacgactgct ccaacgcaag cattgtgtat gaggcagcgg acatgatcat 2460 
gcataccccc gggtgcgtgc cctgcgttcg ggagaacaac tcctcccgct gctgggtagc 2520 
gctcactccc acgctcgcgg ccaggaacgc tagcgtcccc actacgacga tacgacgcca 2580 
tgtcgatttg ctcgttgggg cggctgctct ctgctccgct atgtacgtgg gagatctctg 2640 
cggatctgtt ttcctcgtcg cccagctgtt caccttctcg cctcgccggc acgagacagt 2700 
acaggactgc aattgctcaa tatatcccgg ccacgtgaca ggtcaccgta tggcttggga 2760 
tatgatgatg aactggtcac ctacagcagc cctagtggta tcgcagttac tccggatccc 2820 
acaagctgtc gtggatatgg tggcgggggc ccattgggga gtcctagcgg gccttgccta 2880 
ctattccatg gtggggaact gggctaaggt tctgattgtg atgctactct ttgccggcgt 2940 
tgacggggga acctatgtga caggggggac gatggccaaa aacaccctcg ggattacgtc 3000 
cctcttttca cccgggtcat cccagaaaat ccagcttgta aacaccaacg gcagctggca 3060 
catcaacagg actgccctga actgcaatga ctccctcaac actgggttcc ttgctgcgct 3120 
gttctacgtg cacaagttca actcatctgg atgcccagag cgcatggcca gctgcagccc 3180 
catcgacgcg ttcgctcagg ggtgggggcc catcacttac aatgagtcac acagctcgga 3240 
ccagaggcct tattgttggc actacgcacc ccggccgtgc ggtatcgtac ccgcggcgca 3300 
ggtgtgtggt ccagtgtact gcttcacccc aagccctgtc gtggtgggga cgaccgaccg 3360 
gttcggcgtc cctacgtaca gttgggggga gaatgagacg gacgtgctgc ttcttaacaa 3420 
cacgcggccg ccgcaaggca actggtttgg ctgtacatgg atgaatagca ctgggttcac 3480 
caagacgtgc gggggccccc cgtgtaacat cggggggatc ggcaa1:aaaa ccttgacctg 3540 
ccccacggac tgcttccgga agcaccccga ggccacttac accaagtgtg gttcggggcc 3600 
ttggttgaca cccagatgct tggtccacta cccatacagg ctttggcact acccctgcac 3660 
tgtcaacttt accatcttca aggttaggat gtacgtgggg ggagtggagc acaggctcga 3720 
agccgcatgc aattggactc gaggagagcg ttgtaacctg gaggacaggg acagatcaga 3780 
gcttagcccg ctgctgctgt ctacaacgga gtggcaggta ttgccctgtt ccttcaccac 3840 
cctaccggct ctgtccactg gtttgatcca tctccatcag aacgtcgtgg acgtacaata 3900 
cctgtacggt atagggtcgg cggttgtctc ctttgcaatc aaatgggagt atgtcctgtt 3960 
gctcttcctt cttctggcgg acgcgcgcgt ctgtgcctgc ttgtggatga tgctgctgat 4020 
agctcaagct gaggccgccc tagagaacct ggtggtcctc aacgcggcat ccgtggccgg 4080 
ggcgcatggc attct-ctcct tcctcgtgtt cttctgtgct gcctggtaca tcaagggcag 4140 
gctggtccct ggggcggcat atgccctcta cggcgtatgg ccgctactcc tgctcctgct 4200 
ggcgttacca ccacgagcat acgccatgga ccgggagatg gcagcatcgt gcggaggcgc 4260 
ggttttcgta ggtctgatac tcttgacctt gtcaccgcac tataagctgt tcctcgctag 4320 
gctcatatgg tggttacaat attttatcac cagggccgag gcacacttgc aagtgtggat 4380 
cccccccctc aacgttcggg ggggccgcga tgccgtcatc ctcctcacgt gcgcgatcca 4440 
cccagagcta atcliti-tacca tcaccaaaat ctiigcticgcc atactcggtc cactcatggt 4500 
gctccaggct ggtataacca aagtgccgta cttcgtgcgc gcacacgggc tcattcgtgc 4560 
atgcatgctg gtgcggaagg ttgctggggg tcattatgtc caaatggctc tcatgaagtt 4620 
ggccgcactg acaggtacgt acgtttatga ccatctcacc ccactgcggg actgggccca 4680 
cgcgggccta cgagaccttg cggtggcagt tgagcccgtc gtcttctctg atatggagac 4740 
caaggttatc acctgggggg cagacaccgc ggcgtgtggg gacatcatct tgggcctgcc 4800 
cgtctccgcc cgcaggggga gggagataca tctgggaccg gcagacagcc ttgaagggca 4860 
ggggtggcga ctcctcgcgc ctattacggc ctactcccaa cagacgcgag gcctacttgg 4920 
ctgcatcatc actagcctca caggccggga caggaaccag gtcgaggggg aggtccaagt 4980 
ggtctccacc gcaacacaat ctttcctggc gacctgcgtc aatggcgtgt gttggactgt 5040 
ctatcatggt gccggctcaa agacccttgc cggcccaaag ggcccaatca cccaaatgta 5100 
caccaatgtg gaccaggacc tcgtcggctg gcaagcgccc cccggggcgc gttccttgac 5160 
accatgcacc tgcggcagct cggaccttta cttggtcacg aggcatgccg atgtcattcc 5220 
ggtgcgccgg cggggcgaca gcagggggag cctactctcc cccaggcccg tctcctactt 5280 
gaagggctct tcgggcggtc cactgctctg cccctcgggg cacgctgtgg gcatctttcg 5340 
ggctgccgtg tgcacccgag gggttgcgaa ggcggtggac tttgtacccg tcgagtctat 5400 
ggaaaccact atgcggtccc cggtcttcac ggacaactcg tcccctccgg ccgtaccgca 5460 
gacattccag gtggcccatc tacacgcccc tactggtagc ggcaagagca ctaaggtgcc 5520 
ggctgcgtat gcagcccaag ggtataaggt gcttgtcctg aacccgtccg tcgccgccac 5580 
cctaggtttc ggggcgtata tgtctaaggc acatggtatc gaccctaaca tcagaaccgg 5640 
ggtaaggacc atcaccacgg gtgcccccat cacgtactcc acctatggca agtttcttgc 5700 
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cgacggtggt tgctctgggg gcgcctatga . catcataata tgtgatgagt gccactcaac 5760 
tgactcgacc actatcctgg gcatcggcac agtcctggac caagcggaga cggctggagc 5820 
gcgactcgtc gtgctcgcca ccgctacgcc tccgggatcg gtcaccgtgc cacatccaaa 5880 
catcgaggag gtggctctgt ccagcactgg agaaatcccc ttttatggca aagccatccc 5940 
catcgagacc atcaaggggg ggaggcacct cattttctgc cattccaaga agaaatgtga 6000 
tgagctcgcc gcgaagctgt ccggcctcgg actcaatgct gtagcatatt accggggcct 6060 
tgatgtatcc gtcataccaa ctagcggaga cgtcattgtc gtagcaacgg acgctctaat 6120 
gacgggcttt accggcgatt tcgactcagt gatcgactgc aatacatgtg tcacccagac 6180 
agtcgacttc agcctggacc cgaccttcac cattgagacg acgaccgtgc cacaagacgc 6240 
ggtgtcacgc tcgcagcggc gaggcaggac tggtaggggc aggatgggca tttacaggtt 6300 
tgtgactcca ggagaacggc cctcgggcat gttcgattcc tcggttctgt gcgagtgcta 6360 
tgacgcgggc tgtgcttggt acgagctcac gcccgccgag acctcagtta ggttgcgggc 6420 
ttacctaaac acaccagggt tgcccgtctg ccaggaccat ctggagttct gggagagcgt 6480 
ctttacaggc ctcacccaca tagacgccca tttcttgtcc cagactaagc aggcaggaga 6540 
caacttcccc tacctggtag cataccaggc tacggtgtgc gccagggctc aggctccacc 6600 
tccatcgtgg gaccaaatgt ggaagtgtct catacggcta aagccta'cgc tgcacgggcc 6660 
aacgcccctg ctgtataggc tgggagccgt tcaaaacgag gttactacca cacaccccat 6720 
aaccaaatac atcatggcat gcatgtcggc tgacctggag gtcgtcacga gcacctgggt 6780 
gctggtaggc ggagtcctag cagctctggc cgcgtattgc ctgacaacag gcagcgtggt 6840 
cattgtgggc aggatcatct tgtccggaaa gccggccatc attcccgaca gggaagtcct 6900 
ttaccgggag ttcgatgaga tggaagagtg cgcctcacac ctcccttaca tcgaacaggg 6960 
aatgcagctc gccgaacaat tcaaacagaa ggcaatcggg ttgctgcaaa cagccaccaa 7020 
gcaagcggag gctgctgctc ccgtggtgga atccaagtgg cggaccctcg aagccttctg 7080 
ggcgaagcat atgtggaatt tcatcagcgg gatacaatat ttagcaggct tgtccactct 7140 
• gcctggcaac cccgcgatag catcactgat ggcattcaca gcctctatca ccagcccgct 7200 
caccacccaa cataccctcc tgtttaacat cctgggggga tgggtggccg cccaacttgc 7260 
tcctcccagc gctgcttctg ctttcgtagg cgccggcatc gctggagcgg ctgttggcag 7320 
cataggcctt gggaaggtgc ttgtggatat tttggcaggt tatggagcag gggtggcagg 7380 
cgcgctcgtg gcctttaagg tcatgagcgg cgagatgccc tccaccgagg acctggttaa 7440 
cctactccct gctatcctct cccctggcgc cctagtcgtc ggggtcgtgt gcgcagcgat 7500 
actgcgtcgg cacgtgggcc caggggaggg ggctgtgcag tggatgaacc ggctgatagc 7560 
gttcgcttcg cggggtaacc acgtctcccc cacgcactat gtgcctgaga gcgacgctgc 7620 
agcacgtgtc actcagatcc tctctagtct taccatcact cagctgctga agaggcttca 7680 
ccagtggatc aacgaggact gctccacgcc atgctccggc tcgtggctaa gagatgtttg 7740 
ggattggata tgcacggtgt tgactgattt caagacctgg ctccagtcca agctcctgcc 7800 
gcgattgccg ggagtcccct tcttctcatg tcaacgtggg tacaagggag tctggcgggg 7860 
cgacggcatc atgcaaacca cctgcccatg tggagcacag atcaccggac atgtgaaaaa 7920 
cggttccatg aggatcgtgg ggcctaggac ctgtagtaac acgtggcatg gaacattccc 7980 
cattaacgcg tacaccacgg gcccctgcac gccctccccg gcgccaaatt attctagggc 8040 
gctgtggcgg gtggctgctg aggagtacgt ggaggttacg cgggtggggg atttccacta 8100 
cgtgacgggc atgaccactg acaacgtaaa gtgcccgtgt caggttccgg cccccgaatt 8160 
cttcacagaa gtggatgggg tgcggttgca caggtacgct ccagcgtgca aacccctcct 8220 
acgggaggag gtcacattcc tggtcgggct caatcaatac ctggttgggt cacagctccc 8280 
atgcgagccc gaaccggacg tagcagtgct cacttccatg ctcaccgacc cctcccacat 8340 
tacggcggag acggctaagc gtaggctggc caggggatct occccctcct tggccagctc 8400 
atcagctatc cagctgtctg cgccttcctt gaaggcaaca tgcactaccc gtcatgactc 8460 
cccggacgct gacctcatcg aggccaacct cctgtggcgg caggagatgg gcgggaacat 8520 
cacccgcgtg gagtcagaaa ataaggtagt aattttggac tctttcgagc cgctccaagc 8580 
ggaggaggat gagagggaag tatccgttcc ggcggagatc ctgcggaggt ccaggaaatt 8640 
ccctcgagcg atgcccatat gggcacgccc ggattacaac cctccactgt tagagtcctg 8700 
gaaggacccg gactacgrtcc ctccagtggt acacgggtgt ccattgccgc ctgccaaggc 8760 
ccctccgata ccacctccac ggaggaagag gacggttgtc ctgtcagaat ctaccgtgtc 8820 
ttctgccttg gcggagctcg ccacaaagac cttcggcagc tccgaatcgt cggccgtcga 8880 
cagcggcacg gcaacggcct ctcctgacca gccctccgac gacggcgacg cgggatccga 8940 
cgttgagtcg tactcctcca tgccccccct tgagggggag ccgggggatc ccgatctcag 9000 
cgacgggtct tggtctaccg taagcgagga ggctagtgag gacgtcgtct gctgctcgat 9060 
gtcctacaca tggacaggcg ccctgatcac gccatgcgct gcggaggaaa ccaagctgcc 9120 
catcaatgca ctgagcaact ctttgctccg tcaccacaac ttggtctatg ctacaacatc 9180 
tcgcagcgca agcctgcggc agaagaaggt cacctttgac agactgcagg tcctggacga 9240 
ccactaccgg gacgtgctca aggagatgaa ggcgaaggcg tccacagtta aggctaaact 9300 
tctatccgtg gaggaagcct gtaagctgac gcccccacat tcggccagat ctaaatttgg 9360 
ctatggggca aaggacgtcc ggaacctatc cagcaaggcc gttaaccaca tccgctccgt 9420 
gtggaaggac ttgctggaag acactgagac accaattgac accaccatca tggcaaaaaa 9480 
tgaggttttc tgcgtccaac cagagaaggg gggccgcaag ccagctcgcc ttatcgtatt 9540 
cccagatttg ggggttcgtg tgtgcgagaa aatggccctt tacgatgtgg tctccaccct 9600 
ccctcaggcc gtgatgggct cttcatacgg attccaatac tctcctggac agcgggtcga 9660 
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gttcctggtg aatgcctgga aagcgaagaa atgccctatg ggcttcgcat atgacacccg 9720 
ctgttttgac tcaacggtca ctgagaatga catccgtgtt gaggagtcaa tctaccaatg 9780 
ttgtgacttg gcccccgaag ccagacaggc cataaggtcg ctcacagagc .ggctttacat 9840 
cgggggcccc ctgactaatt ctaaagggca gaactgcggc tatcgccggt gccgcgcgag 9900 
cggtgtactg acgaccagct gcggtaatac cctcacatgt tacttgaagg ccgctgcggc 9960 
ctgtcgagct gcgaagctcc aggactgcac gatgctcgta tgcggagacg accttgtcgt 10020 
tatctgtgaa agcgcgggga cccaagagga cgaggcgagc ctacgggcct tcacggaggc 10080 
tatgactaga tactctgccc cccctgggga cccgcccaaa ccagaatacg acttggagtt 10140 
gataacatca tgctcctcca atgtgtcagt cgcgcacgat gcatctggca aaagggtgta 10200 
ctatctcacc cgtgacccca ccacccccct tgcgcgggct gcgtgggaga cagctagaca 10260 
cactccagtc aattcctggc taggcaacat catcatgtat gcgcccacct tgtgggcaag 10320 
gatgatcctg atgactcatt tcttctccat ccttctagct caggaacaac ttgaaaaagc 10380 
cctagattgt cagatctacg gggcctgtta ctccattgag ccacttgacc tacctcagat 10440 
cattcaacga ctccatggcc ttagcgcatt ttcactccat agttactctc caggtgagat 10500 
caatagggtg gcttcatgcc tcaggaaact tggggtaccg cccttgcgag tctggagaca 10560 
tcgggccaga agtgtccgcg ctaggctact gtcccagggg gggagggctg ccacttgtgg 10620 
caagtacctc ttcaactggg cagtaaggac caagctcaaa ctcactccaa tcccggctgc 10680 
gtcccagttg gatttatcca gctggttcgt tgctggttac agcgggggag acatatatca 10740 
cagcctgtct cgtgcccgac cccgctggtt catgtggtgc ctactcctac tttctgtagg 10800 
ggtaggcatc tatctactcc ccaaccgatg aacggggacc taaacactcc aggccaatag 10860 
gccatcctgt ttttttccct tttttttttt cttttttttt tttttttttt tttttttttt 10920 
tttttctcct ttttttttcc tctttttttc cttttctttc ctttggtggc tccatcttag 10980 
ccctagtcac ggctagctgt gaaaggtccg tgagccgctt gactgcagag agtgctgata 11040 
ctggcctctc tgcagatcaa gt 11062 

<210> 24 
<2H> 9605 
<212> DNA 

<213> Hepatitis C virus 
<400> 24 

gccagccccc gattgggggc gacactccac catagatcac tcccctgtga ggaactactg 60 
tcttcacgca gaaagcgtct agccatggcg ttagtatgag tgtcgtgcag cctccaggac 120 
cccccctccc gggagagcca tagtggtctg cggaaccggt gagtacaccg gaattgccag 180 
gacgaccggg tcctttcttg gatcaacccg ctcaatgcct ggagatttgg gcgtgccccc 240 
gcgagactgc tagccgagta gtgttgggtc gcgaaaggcc ttgtggtact gcctgatagg 300 
gtgcttgcga gtgccccggg aggtctcgta gaccgtgcac catgagcacg aatcctaaac 360 
ctcaaagaaa aaccaaacgt aacaccaacc gccgcccaca ggacgtcaag ttcccgggcg 420 
gtggtcagat cgtcggtgga gtttacctgt tgccgcgcag gggccccagg ttgggtgtgc 480 
gcgcgactag gaagacttcc gagcggtcgc aacctcgtgg aaggcgacaa cctatcccca 540 
aggctcgcca gcccgagggt agggcctggg ctcagcccgg gtacccctgg cccctctatg 600 
gcaatgaggg cttggggtgg gcaggatggc tcctgtcacc ccgtggctct cggcctagtt 660 
ggggccccac ggacccccgg cgtaggtcgc gcaatttggg taaggtcatc gataccctca 720 
cgtgcggctt cgccgatctc atggggtaca ttccgctcgt cggcgccccc ctagggggcg 780 
. ctgccagggc cctggcgcat ggcgtccggg ttctggagga cggcgtgaac tatgcaacag 840 
ggaatctgcc cggttgctcc ttttctatct tccttttggc tttgctgtcc tgtttgacca 900 
tcccagcttc cgcttatgaa gtgcgcaacg tatccggagt gtaccatgtc acgaacgact 960 
gctccaacgc aagcattgtg tatgaggcag cggacatgat catgcatacc cccgggtgcg 1020 
tgccctgcgt tcgggagaac aactcctccc gctgctgggt agcgctcact cccacgctcg 1080 
cggccaggaa cgctagcgtc cccactacga cgatacgacg ccatgtcgat ttgctcgttg 1140 
gggcggctgc tctctgctcc gctatgtacg tgggagatct ctgcggatct gttttcctcg 1200 
tcgcccagct gttcaccttc tcgcctcgcc ggcacgagac agtacaggac tgcaattgct 1260 
caatatatcc cggccacgtg acaggtcacc gtatggcttg ggatatgatg atgaactggt 1320 
cacctacagc agccctagtg gtatcgcagt tactccggat cccacaagct gtcgtggata 1380 
tggtggcggg ggcccattgg ggagtcctag cgggccttgc ctactattcc atggtgggga 1440 
actgggctaa ggttctgatt gtgatgctac tctttgccgg cgttgacggg ggaacctatg 1500 
tgacaggggg gacgatggcc aaaaacaccc tcgggattac gtccctcttt tcacccgggt 1560 
catcccagaa aatccagctt gtaaacacca acggcagctg gcacatcaac aggactgccc 1620 
tgaactgcaa tgactccctc aacactgggt tccttgctgc gctgttctac gtgcacaagt 1680 
tcaactcatc tggatgccca gagcgcatgg ccagctgcag ccccatcgac gcgttcgctc 1740 
aggggtgggg gcccatcact tacaatgagt cacacagctc ggaccagagg ccttattgtt 1800 
ggcactacgc accccggccg tgcggtatcg tacccgcggc gcaggtgtgt ggtccagtgt 1860 
actgcttcac cccaagccct gtcgtggtgg ggacgaccga ccggttcggc gtccctacgt 1920 
acagttgggg ggagaatgag acggacgtgc tgcttcttaa caacacgcgg ccgccgcaag 1980 
gcaactggtt tggctgtaca tggatgaata gcactgggtt caccaagacg tgcgggggcc 2040 
ccccgtgtaa catcgggggg atcggcaata aaaccttgac ctgccccacg gactgcttcc 2100 
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ggaagcaccc cgaggccact tacaccaagt gtggttcggg gccttggttg acacccagat 2160 
gct1:ggtcca ctacccatac aggctttggc actacccctg cactgtcaac tttaccatct 2220 
tcaaggttag gatgtacgtg gggggagtgg agcacaggct cgaagccgca tgcaattgga 2280 
ctcgaggaga gcgttgtaac ctggaggaca gggacagatc agagcttagc ccgctgctgc 2340 
tgtctacaac ggagtggcag gtattgccct gttccttcac caccctaccg gctctgtcca 2400 
ctggtttgat ccatctccat cagaacgtcg tggacgtaca atacctgtac ggtatagggt 2460 
cggcggttgt ctcctttgca atcaaatggg agtatgtcct gttgctcttc cttcttctgg 2520 
cggacgcgcg cgtctgtgcc tgcttgtgga tgatgctgct gatagctcaa gctgaggccg 2580 
ccctagagaa cctggtggtc ctcaacgcgg catccgtggc cggggcgcat ggcattctct 2640 
ccttcctcgt gttcttctgt gctgcctggt acatcaaggg caggctggtc cctggggcgg 2700 
catatgccct ctacggcgta tggccgctac tcctgctcct gctggcgtta ccaccacgag 2760 
catacgccat ggaccgggag atggcagcat cgtgcggagg cgcggttttc gtaggtctga 2820 
tactcttgac cttgtcaccg cactataagc tgttcctcgc taggctcata tggtggttac 2880 
aatattttat caccagggcc gaggcacact tgcaagtgtg gatccccccc ctcaacgttc 2940 
gggggggccg cgatgccgtc atcctcctca cgtgcgcgat ccacccagag ctaatcttta 3000 
ccatcaccaa aatcttgctc gccatactcg gtccactcat ggtgctccag gctggtataa 3060 
ccaaagtgcc gtacttcgtg cgcgcacacg ggctcattcg tgcatgcatg ctggtgcgga 3120 
aggttgctgg gggtcattat gtccaaatgg ctctcatgaa gttggccgca ctgacaggta 3180 
cg1:acgttta tgaccatctc accccactgc gggactgggc ccacgcgggc ctacgagacc 3240 
ttgcggtggc agttgagccc gtcgtcttct ctgatatgga gaccaaggtt atcacctggg 3300 
gggcagacac cgcggcgtgt ggggacatca tcttgggcct gcccgtctcc gcccgcaggg 3360 
993999^9^^ acatctggga ccggcagaca gccttgaagg gcaggggtgg cgactcctcg 3420 
cgcctattac ggcctactcc caacagacgc gaggcctact. tggctgcatc atcactagcc 3480 
tcacaggccg ggacaggaac caggtcgagg gggaggtcca agtggtctcc accgcaacac 3540 
aatctttcct ggcgacctgc gtcaatggcg tgtgttggac tgtctatcat ggtgccggct 3600 
caaagaccct tgccggccca aagggcccaa tcacccaaat gtacaccaat gtggaccagg 3660 
acctcgtcgg ctggcaagcg ccccccgggg cgcgttcctt gacaccatgc acctgcggca 3720 
gctcggacct ttacttggtc acgaggcatg ccgatgtcat tccggtgcgc cggcggggcg 3780 
acagcagggg gagcctactc tcccccaggc ccgtctccta cttgaagggc tcttcgggcg 3840 
gtccactgct ctgcccctcg gggcacgctg tgggcatctt tcgggctgcc gtgtgcaccc 3900 
gaggggttgc gaaggcggtg gactttgtac ccgtcgagtc tatggaaacc actatgcggt 3960 
ccccggtctt cacggacaac tcgtcccctc cggccgtacc gcagacattc caggtggccc 4020 
atctacacgc ccctactggt agcggcaaga gcactaaggt gccggctgcg tatgcagccc 4080 
aagggtataa ggtgcttgtc ctgaacccgt ccgtcgccgc caccctaggt ttcggggcgt 4140 
atatgtctaa ggcacatggt atcgacccta acatcagaac cggggtaagg accatcacca 4200 
cgggtgcccc catcacgtac tccacctatg gcaagtttct tgccgacggt ggttgctctg 4260 
ggggcgccta tgacatcata atatgtgatg agtgccactc aactgactcg accacrtatcc 4320 
tgggca'tcgg cacagtcctg gaccaagcgg agacggctgg agcgcgactc gtcgtgctcg 4380 
ccaccgctac gcctccggga tcggtcaccg tgccacatcc aaacatcgag gaggtggctc 4440 
tgtccagcac tggagaaatc cccttttatg gcaaagccat ccccatcgag accatcaagg 4500 
gggggaggca ccrtcattttc tgccattcca agaagaaatg tgatgagctc gccgcgaagc 4560 
tgtccggcct cggactcaat gctgtagcat attaccgggg ccttgatgta tccgtcatac 4620 
caactagcgg agacgtcatt gtcgtagcaa cggacgctct aatgacgggc tttaccggcg 4 680 
atttcgactc agtgatcgac tgcaatacat gtgtcaccca gacagtcgac ttcagcctgg 4740 
acccgacctt caccattgag acgacgaccg tgccacaaga cgcggtgtca cgctcgcagc 4800 
ggcgaggcag gactggtagg ggcaggatgg gcatttacag gtttgtgact ccaggagaac 4860 
ggccctcggg catgttcgat tcctcggttc tgtgcgagtg ctatgacgcg ggctgtgctt 4920 
ggtacgagct cacgcccgcc gagacctcag ttaggttgcg ggcttaccta aacacaccag 4980 
ggttgcccgt ctgccaggac catctggagt tctgggagag cgtctttaca ggcctcaccc 5040 
acatagacgc ccatttcttg tcccagacta agcaggcagg agacaacttc ccctacctgg 5100 
tagcatacca ggctacggtg tgcgccaggg ctcaggctcc acctccatcg tgggaccaaa 5160 
tgtggaagtg tctcatacgg ctaaagccta cgctgcacgg gccaacgccc ctgctgtata 5220 
ggcrtgggagc cgttcaaaac gaggttacta ccacacaccc cataaccaaa tacatcatgg 5280 
catgcatgtc ggctgacctg gaggtcgtca cgagcacctg ggtgctggta ggcggagtcc 5340 
tagcagctct ggccgcgtat tgcctgacaa caggcagcgt ggtcattgtg ggcaggatca 5400 
tcttgtccgg aaagccggcc atcattcccg acagggaagt cctttaccgg gagttcgatg 5460 
agatggaaga gtgcgcctca cacctccctt acatcgaaca gggaatgcag ctcgccgaac 5520 
aattcaaaca gaaggcaatc gggttgctgc aaacagccac caagcaagcg gaggctgctg 5580 
ctcccgtggt ggaatccaag tggcggaccc tcgaagcctt ctgggcgaag catatgtgga 5640 
atttcatcag cgggatacaa tatttagcag gcttgtccac tctgcctggc aaccccgcga 5700 
tagcatcact gatggcattc acagcctcta tcaccagccc gctcaccacc caacataccc 5760 
tcctgtttaa catcctgggg ggatgggtgg ccgcccaact tgctcctccc agcgctgctt 5820 
ctgctttcgt aggcgccggc atcgctggag cggctgttgg cagcataggc cttgggaagg 5880 
tgcttgtgga tattttggca ggttatggag caggggtggc aggcgcgctc gtggccttta 5940 
aggtcatgag cggcgagatg ccctccaccg aggacctggt taacctactc cctgctatcc 6000 
tctcccctgg cgccctagtc gtcggggtcg tgtgcgcagc gatactgcgt cggcacgtgg 6060 
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gcccagggga gggggctgtg cagtggatga 
accacgtctc ccccacgcac tatgtgcctg 
tcctctctag tcttaccatc actcagctgc 
actgctccac gccatgctcc ggctcgtggc 
tgttgactga tttcaagacc tggctccagt 
ccttcttctc atgtcaacgt gggtacaagg 
ccacctgccc atgtggagca cagatcaccg 
tggggcctag gacctgtagt aacacgtggc 
cgggcccctg cacgccctcc ccggcgccaa 
ctgaggagta cgtggaggtt acgcgggtgg 
ctgacaacgt aaagtgcccg tgtcaggt.tc 
999tgcggtt gcacaggtac gctccagcgt 
tcctggtcgg gctcaatcaa tacctggttg 
acgtagcagt gctcacttcc atgctcaccg 
agcgtaggct ggccagggga tctcccccct 
ctgcgccttc cttgaaggca acatgcacta 
tcgaggccaa cctcctgtgg cggcaggaga 
aaaataaggt agtaattttg gactctttcg 
aagtatccgt tccggcggag atcctgcgga 
tatgggcacg cccggattac aaccctccac 
tccctccagt ggtacacggg tgtccattgc 
cacggaggaa gaggacggtt gtcctgtcag 
tcgccacaaa gaccttcggc agctccgaat 
cctctcctga ccagccctcc gacgacggcg 
ccatgccccc ccttgagggg gagccggggg 
ccgtaagcga ggaggctagt gaggacgtcg 
gcgccctgat cacgccatgc gctgcggagg 
actctttgct ccgtcaccac aacttggtct 
ggcagaagaa ggtcaccttt gacagractgc 
tcaaggagat gaaggcgaag gcgtccacag 
cctgtaagct gacgccccca cattcggcca 
tccggaacct atccagcaag gccgttaacc 
aagacactga gacac€:aatt gacaccacca 
aaccagagaa ggggggccgc aagccagctc 
gtgtgtgcga gaaaatggcc ctttacgat:g 
gctcttcata cggattccaa tactctcctg 
ggaaagcgaa gaaatgccct atgggcttcg 
tcactgagaa tgacatccgt gttgaggagt 
aagccagaca ggccataagg tcgctcacag 
attcliaaagg gcagaactgc ggctatcgcc 
gctgcggtaa taccctcaca tgttacttga 
tccaggactg cacgatgctc gtatgcggag 
ggacccaaga ggacgaggcg agcctacggg 
ccccccctgg ggacccgccc aaaccagaat 
ccaatgtgtc agtcgcgcac gatgcatctg 
ccaccacccc ccttgcgcgg gctgcgtggg 
ggctaggcaa catcatcatg tatgcgccca 
atttcttctc catccttcta gctcaggaac 
acggggcctg ttactccatt gagccacttg 
gccttagcgc attttcactc catagttact 
gcctcaggaa acttggggta ccgcccttgc 
gcgctaggct actgtcccag ggggggaggg 
gggcagtaag gaccaagctc aaactcactc 
ccagctggtt cgttgctggt tacagcgggg 
gaccccgctg gttcatgtgg tgcctactcc 
tccccaaccg atgaacgggg acctaaacac 
cctttttttt tttctttttt tttttttttt 
tcctcttttt ttccttttct ttcctttggt 
tgtgaaaggt ccgtgagccg cttgactgca 
caagt 
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accggctgat agcgttcgct tcgcggggta 6120 
agagcgacgc tgcagcacgt gtcactcaga 6180 
tgaagaggct tcaccagtgg atcaacgagg 6240 
taagagatgt ttgggattgg atatgcacgg 6300 
ccaagctcct gccgcgattg ccgggagtcc 6360 
gagtctggcg gggcgacggc atcatgcaaa 6420 
gacatgtgaa aaacggttcc atgaggatcg 6480 
atggaacatt ccccattaac gcgtacacca 6540 
attattctag ggcgctgtgg cgggtggctg 6600 
gggatttcca ctacgtgacg ggcatgacca 6660 
cggcccccga attcttcaca gaagtggatg 6720 
gcaaacccct cctacgggag gaggtcacat 6780 
ggtcacagct cccatgcgag cccgaaccgg 6840 
acccctccca cattacggcg gagacggcta 6900 
ccttggccag ctcatcagct atccagctgt 6960 
cccgtcatga ctccccggac gctgacctca 7020 
tgggcgggaa catcacccgc gtggagtcag 7080 
agccgctcca agcggaggag gatgagaggg 7140 
ggtccaggaa attccctcga gcgatgccca 7200 
tgttagagtc ctggaaggac ccggactacg 7260 
cgcctgccaa ggcccctccg ataccacctc 7320 
aatctaccgt gtcttctgcc ttggcggagc 7380 
cgtcggccgt cgacagcggc acggcaacgg 7440 
acgcgggatc cgacgttgag tcgtactcct 7500 
atcccgatct cagcgacggg tcttggtcta 7560 
tctgctgctc gatgtcctac acatggacag 7620 
aaaccaagct gcccatcaat gcactgagca 7680 
atgctacaac atctcgcagc gcaagcctgc 7740 
aggtcctgga cgaccactac cgggacgtgc 7800 
ttaaggctaa acttetatcc gtggaggaag 7860 
gatctaaatt tggctatggg gcaaaggacg 7920 
acatccgctc cgtgtggaag gacttgctgg 7980 
tcatggcaaa aaatgaggtt ttctgcgtcc 8040 
gccttatcgt attcccagat ttgggggttc 8100 
tggtctccac cctccctcag gccgtgatgg 8160 
gacagcgggt cgagttcctg gtgaatgcct 8220 
catatgacac ccgctgtttt gactcaacgg 8280 
caatctacca atgttgtgac ttggcccccg 8340 
£i9cggcttta catcgggggc cccctgacta 8400 
ggtgccgcgc gagcggtgta ctgacgacca 8460 
aggccgctgc ggcctgtcga gctgcgaagc 8520 
acgaccttgt cgttatctgt gaaagcgcgg 8580 
ccttcacgga ggctatgact agatactctg 8640 
acgacttgga gttgataaca tcatgctcct 8700 
gcaaaagggt gtajctatctc acccgtgacc 8760 
agacagctag acacactcca gtcaattcct 8820 
ccttgtgggc aaggatgatc ctgatgactc 8880 
aacttgaaaa agccctagat tgtcagatct 8940 
acctacctca gatcattcaa cgactccatg 9000 
ctccaggtga gatcaatagg gtggcttcat 9060 
gagtctggag acatcgggcc agaagtgtcc 9120 
ctgccacttg tggcaagtac ctcttcaact 9180 
caatcccggc tgcgtcccag ttggatttat 9240 
gagacatata tcacagcctg tctcgtgccc 9300 
tactttctgt aggggtaggc atctatctac 9360 
tccaggccaa taggccatcc tgtttttttc 9420 
tttttttttt ttttttttct cctttttttt 9480 
ggctccatct tagccctagt cacggctagc 9540 
gagagtgctg atactggcct ctctgcagat 9600 
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